GKE does not scale to/from 0 when autoscaling enabled

Tags:

I want to run a CronJob on my GKE in order to perform a batch operation on a daily basis. The ideal scenario would be for my cluster to scale to 0 nodes when the job is not running and to dynamically scale to 1 node and run the job on it every time the schedule is met.

I am first trying to achieve this by using a simple CronJob found in the kubernetes doc that only prints the current time and terminates.

I first created a cluster with the following command:

gcloud container clusters create $CLUSTER_NAME \
    --enable-autoscaling \
    --min-nodes 0 --max-nodes 1 --num-nodes 1 \
    --zone $CLUSTER_ZONE

Then, I created a CronJob with the following description:

apiVersion: batch/v1beta1
kind: CronJob
metadata:
  name: hello
spec:
  schedule: "1 * * * *"
  jobTemplate:
    spec:
      template:
        spec:
          containers:
          - name: hello
            image: busybox
            args:
            - /bin/sh
            - -c
            - date; echo Hello from the Kubernetes cluster
          restartPolicy: Never

The job is scheduled to run every hour and to print the current time before terminating.

First thing, I wanted to create the cluster with 0 nodes but setting --num-nodes 0 results in an error. Why is it so? Note that I can manually scale down the cluster to 0 nodes after it has been created.

Second, if my cluster has 0 nodes, the job won't be scheduled because the cluster does not scale to 1 node automatically but instead gives the following error:

Cannot schedule pods: no nodes available to schedule pods.

Third, if my cluster has 1 node, the job runs normally but after that, the cluster won't scale down to 0 nodes but stay with 1 node instead. I let my cluster run for two successive jobs and it did not scale down in between. I assume one hour should be long enough for the cluster to do so.

What am I missing?

EDIT: I've got it to work and detailed my solution here.

997

asked Aug 15 '18 08:08

xEc

1 Answers

Update:

Note: Beginning with Kubernetes version 1.7, you can specify a minimum size of zero for your node pool. This allows your node pool to scale down completely if the instances within aren't required to run your workloads.

https://cloud.google.com/kubernetes-engine/docs/concepts/cluster-autoscaler

Old answer:

Scaling the entire cluster to 0 is not supported, because you always need at least one node for system pods:

See docs

You could create one node pool with a small machine for system pods, and an additional node pool with a big machine where you would run your workload. This way the second node pool can scale down to 0 and you still have space to run the system pods.

After attempting, @xEc mentions: Also note that there are scenarios in which my node pool wouldn't scale, like if I created the pool with an initial size of 0 instead of 1.

Initial suggestion:

Perhaps you could run a micro VM, with cron to scale the cluster up, submit a Job (instead of CronJob), wait for it to finish and then scale it back down to 0?

182

answered Oct 25 '22 03:10

Robert Lacok

Related questions
                            
                                WIll "helm upgrade" restart PODS even if they are not affected by upgrade?
                            
                                Kubernetes set-up on ubuntu on Google compute
                            
                                minikube does not start on ubuntu 20.04 LTS. Exiting due to GUEST_PROVISION
                            
                                How can I correctly setup custom headers with nginx ingress?
                            
                                GCE ingress with routes always falls back to default-http-backend
                            
                                VPN access for applications running inside a shared Kubernetes cluster
                            
                                Setting up Kubernetes on NixOS
                            
                                Change Kubernetes docker-for-desktop cluster network ip
                            
                                what are the kubernetes/elb time outs for http requests?
                            
                                Debugging a container in a crash loop on Kubernetes
                            
                                How to define multiple containers in declarative pipeline?
                            
                                Minikube not working in Windows 8
                            
                                Spring Kafka producers throwing TimeoutExceptions
                            
                                kubernetes init container CrashLoopBackOff
                            
                                Get docker-image labels from kubernetes API

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

GKE does not scale to/from 0 when autoscaling enabled

Tags:

google-cloud-platform

kubernetes

google-kubernetes-engine

kubernetes-cronjob

autoscaling

xEc

People also ask

1 Answers

Robert Lacok

Recent Activity

Donate For Us