Kubeadm and the Risks of Scheduling Pods on Master Node (Pods always Pending)

Tags:

While following the kubernetes article on Using kubeadm to Create a Cluster, I was stuck when the AddOn pods I was trying to install (Nginx, Tiller, Grafana, InfluxDB, Dashboard) would always stay in a state of Pending.

Checking the message from kubectl describe pod tiller-deploy-df4fdf55d-jwtcz --namespace=kube-system resulted in the following message:

Type     Reason            Age                From               Message
----     ------            ----               ----               -------
Warning  FailedScheduling  51s (x15 over 3m)  default-scheduler  0/1 nodes are available: 1 node(s) had taints that the pod didn't tolerate.

When I ran the command from the Master Isolation section kubectl taint nodes --all node-role.kubernetes.io/master-, the AddOns would install as expected.

At this point I can only suspect (because they are already installed on the master node) that the reason was that I hadn't connected a worker node to the cluster yet for the scheduler to schedule the pods on.

The documentation states "your cluster will not schedule pods on the master for security reasons". I know that this is a non-production environment so there is little risk in this situation but what is the risk of removing that taint in a production cluster?

Follow-up: If this is a risk, how can I re-add that taint so I can then uninstall the AddOn pods and try to have the scheduler install them on my Worker Node?

Environment Details: Operating System - CentOS 7.4.1708 (Core) Kubernetes Version - 1.10

680

asked Apr 06 '18 13:04

Flea

1 Answers

the reason was that I hadn't connected a worker node to the cluster yet for the scheduler to schedule the pods on.

100% correct. You will for sure want some worker nodes, otherwise the idea of "scheduling work" becomes very weird.

but what is the risk of removing that taint in a production cluster?

I am not a kubernetes security expert, but a pragmatic risk is CPU, I/O, and/or memory exhaustion on the master nodes, which would have very severe consequences to the health of the cluster. There is almost never a reason to run any workload on a master node, and almost entirely an increase in risk, so the advice "just don't do it" is well founded.

how can I re-add that taint so I can then uninstall the AddOn pods and try to have the scheduler install them on my Worker Node?

I'm not sure I follow that question, but I would for sure start by just adding a worker node before trying to do complicated stuff with taints and tolerations.

111

answered Oct 05 '22 04:10

mdaniel

Related questions
                            
                                Where are the possible metrics for kubernetes autoscaling defined
                            
                                Cannot determine if job needs to be started: Too many missed start time (> 100). Set or decrease .spec.startingDeadlineSeconds or check clock skew
                            
                                kubernetes add local files to pod
                            
                                Kustomize how to replace only the host in Ingress configuration
                            
                                gcloud ingress loadbalancer / static ip
                            
                                what does kubernetes use the container pause-amd64 for?
                            
                                Not able to see Kubernetes UI Dashboard
                            
                                How to exec into an init container?
                            
                                Why should I specify service before deployment in a single Kubernetes configuration file?
                            
                                Kubernetes how to access a service from another namespace
                            
                                How to Refresh Worker Secrets Without Killing Deployment?
                            
                                How to Route to specific pod through Kubernetes Service (like a Gateway API)
                            
                                kubectl get componentstatus shows unhealthy
                            
                                Can't delete pods in pending state?
                            
                                Mount Google storage bucket in Google container
                            
                                Assign an External IP to a Node
                            
                                kubernetes installation and kube-dns: open /run/flannel/subnet.env: no such file or directory
                            
                                I am trying to use gcs bucket as the volume in gke pod
                            
                                Kubernetes Ingress not adding the application URL for grafana dashboard
                            
                                Multiple node pools vs single pool with many machines vs big machines

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Kubeadm and the Risks of Scheduling Pods on Master Node (Pods always Pending)

Tags:

kubernetes

centos7

kubeadm

Flea

People also ask

1 Answers

mdaniel

Recent Activity

Donate For Us