Best way for running long Python scripts on GCP

Question

We are starting a new project in our company where we basically run few Python scripts for each client, twice a day. So the idea is, twice a day a Cloud Function will be triggered where the function will trigger the Python script for each client creating new instances of App Engine / Cloud Run or any other serverless service Google's offer.

At the begining we though of using Cloud Functions, but very quickly we found out they are not suited for long running Python scripts, the scripts will eventually calculate and collect different information for each client and write them to Firebase.

The flow of the processes would be: Cloud Function triggered -> function trigger GCP instance for each client -> script running for each client -> out put is being saved to Firebase.

What would be the recommended way to do it without a dedicated server, which GCP serverless services would fit the most?

guillaume blaquiere · Accepted Answer

There is a lot of great answers! The key here is to decouple and to distribute the processing.

When you talk about decoupling you can use Cloud Task (where you can add flow control with rate limit or to postpone a task in the future) or PubSub (more simple message queueing solution).

And Cloud Run is a requirement to run up to 15 minutes processing. But you will have to fine tune it (see below my tips)

So, to summarize the process

You have to trigger a Cloud Functions twice a day. You can use Cloud Scheduler for that.
The triggered Cloud Functions get the list of clients (in database?) and for each client, create a task on Cloud Task(or a message in PubSub)
Each task (or message) call a HTTP endpoint on Cloud Run that perform the process for each client. Set the timeout to 30 minutes on Cloud Run.

However, if your processing is compute intensive, you have to tune Cloud Run. If the processing take 15 minutes for 1 client on 1vCPU, that mean you can't process more than 1 client per CPU if you don't want to reach the timeout (2 clients can lead you to take about 30 minutes for both on the same CPU and you can reach the timeout). For that, I recommend you to set the concurrency parameter of Cloud Run to 1, to process only one request at a time (of course, if you set 2 or 4 CPU on Cloud Run you can also increase the concurrency parameter to 2 or 4 to allow parallel processing on the same instance, but on different CPU).

If the processing is not CPU intensive (you perform API call and you wait the answer) it's harder to say. Try with a concurrency of 5, 10, 30,... and observe the behaviour/latency of the processed requests. No worries, with Cloud Task and PubSUb you can set retry policies in case of timeout.

Last things: is your processing idempotent? I mean, if you run 2 time the same process for the same client, is the result correct or is it a problem? Try to make the solution idempotent to overcome retry issues and globally issues that can happen on distributed computing (including the replays)

Robert G · Answer

@NoCommandLine's answer is a best recommendation and Cloud Run is also a good option if you want to set longer running operations as timeout could be set between 5 minutes (as default) and 60 minutes. You can set or update request timeout through either Cloud Console, command line or YAML.

Meanwhile, execution time for Cloud Function only has 1 minute (by default) and could be set to 9 minutes maximum.

You can check out the full documentation below:

Requesting Timeout for Cloud Run
Requesting Timeout for Cloud Function

You can also check a related SO question through this link.

Best way for running long Python scripts on GCP

Tags:

google-app-engine

google-cloud-platform

google-cloud-functions

ia244

2 Answers

guillaume blaquiere

Robert G

Recent Activity

Donate For Us

Best way for running long Python scripts on GCP

Tags:

google-app-engine

google-cloud-platform

google-cloud-functions

ia244

2 Answers

guillaume blaquiere

Robert G

Related questions

Recent Activity

Donate For Us