How does Kubernetes' scheduler work?

Tags:

kubernetes

How does Kubernetes' scheduler work? What I mean is that Kubernetes' scheduler appears to be very simple?

My initial thought is that this scheduler is just a simple admission control system, not a real scheduler. Is it that correct?

I found a short description, but it is not terribly informative:

The kubernetes scheduler is a policy-rich, topology-aware, workload-specific function that significantly impacts availability, performance, and capacity. The scheduler needs to take into account individual and collective resource requirements, quality of service requirements, hardware/software/policy constraints, affinity and anti-affinity specifications, data locality, inter-workload interference, deadlines, and so on. Workload-specific requirements will be exposed through the API as necessary.

487

asked Mar 04 '15 15:03

1 Answers

The paragraph you quoted describes where we hope to be in the future (where the future is defined in units of months, not years). We're not there yet, but the scheduler does have a number of useful features already, enough for a simple deployment. In the rest of this reply, I'll explain how the scheduler works today.

The scheduler is configurable. It has two types of policies, FitPredicate (see master/pkg/scheduler/predicates.go) and PriorityFunction (see master/pkg/scheduler/priorities.go). I'll describe them.

Fit predicates are required rules, for example the labels on the node must be compatible with the label selector on the pod (this rule is implemented in PodSelectorMatches() in predicates.go), and the sum of the requested resources of the container(s) already running on the machine plus the requested resources of the new container(s) you are considering scheduling onto the machine must not be greater than the capacity of the machine (this rule is implemented in PodFitsResources() in predicates.go; note that "requested resources" is defined as pod.Spec.Containers[n].Resources.Limits, and if you request zero resources then you always fit). If any of the required rules are not satisfied for a particular (new pod, machine) pair, then the new pod is not scheduled on that machine. If after checking all machines the scheduler decides that the new pod cannot be scheduled onto any machine, then the pod remains in Pending state until it can be satisfied by one of the machines.

After checking all of the machines with respect to the fit predicates, the scheduler may find that multiple machines "fit" the pod. But of course, the pod can only be scheduled onto one machine. That's where priority functions come in. Basically, the scheduler ranks the machines that meet all of the fit predicates, and then chooses the best one. For example, it prefers the machine whose already-running pods consume the least resources (this is implemented in LeastRequestedPriority() in priorities.go). This policy spreads pods (and thus containers) out instead of packing lots onto one machine while leaving others empty.

When I said that the scheduler is configurable, I mean that you can decide at compile time which fit predicates and priority functions you want Kubernetes to apply. Currently, it applies all of the ones you see in predicates.go and priorities.go.

answered Sep 29 '22 11:09

DavidO

Related questions
                            
                                error starting docker daemon on ubuntu 14.04 (Devices cgroup isn't mounted)
                            
                                How to create a namespace if it doesn't exists from HELM templates?
                            
                                Invalid x509 certificate for kubernetes master
                            
                                Static outgoing IP in Kubernetes
                            
                                How to clear CrashLoopBackOff
                            
                                Kubernetes - what does <unset> mean in port in a service?
                            
                                Kubernetes Pod fails with CrashLoopBackOff
                            
                                kubernetes - kubectl run vs create and apply
                            
                                How do I run private docker images on Google Container Engine
                            
                                Create kubernetes pod with volume using kubectl run
                            
                                Kubernetes: Role vs ClusterRole
                            
                                multiple command in postStart hook of a container
                            
                                Kubernetes cannot pull image from private docker image repository
                            
                                Configure kubectl command to access remote kubernetes cluster on azure
                            
                                Is there anyway to get the external ports of the kubernetes cluster
                            
                                Kubernetes Helm stuck with an update in progress
                            
                                Kubernetes rolling deployments and database migrations
                            
                                When I use Deployment in Kubernetes, what's the differences between apps/v1beta1 and extensions/v1beta1?
                            
                                How to completely uninstall Minikube in windows 10 Pro? (chocolatey)
                            
                                What is the equivalent for depends_on in kubernetes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does Kubernetes' scheduler work?

Tags:

kubernetes

Halacs

People also ask

1 Answers

DavidO

Recent Activity

Donate For Us