Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Will Google Cloud Run support GPU/TPU some day?

So far Google Cloud Run support CPU. Is there any plan to support GPU? It would be super cool if GPU available, then I can demo the DL project without really running a super expensive GPU instance.

like image 738
lixinso Avatar asked May 26 '19 16:05

lixinso


People also ask

Does cloud run support GPU?

GPU units are not yet supported for managed Cloud Run containers, as shown on this comparison chart. Cloud Run for Anthos, on the other hand, does support GPUs. This documentation details how to use NVIDIA GPUs on your instance of Anthos Cloud Run.

Does Google Cloud have GPU?

All the benefits of Google CloudRun GPU workloads on Google Cloud Platform where you have access to industry-leading storage, networking, and data analytics technologies.

What GPUs does Google cloud use?

GPU-Accelerated Containers from NGC NGC provides simple access to pre-integrated and GPU-optimized containers for deep learning software, HPC applications, and HPC visualization tools that take full advantage of NVIDIA A100, V100, P100 and T4 GPUs on Google Cloud.


2 Answers

You can use GPU with Cloud Run for Anthos

https://cloud.google.com/anthos/run/docs/configuring/compute-power-gpu

like image 193
Reza Avatar answered Sep 29 '22 11:09

Reza


So far Google Cloud Run support CPU. Is there any plan to support GPU? It would be super cool if GPU available, then I can demo the DL project without really running a super expensive GPU instance.

I seriously doubt it. GPU/TPUs are specialized hardware. Cloud Run is a managed container service that:

  1. Enables you to run stateless containers that are invokable via HTTP requests. This means that CPU intensive applications are not supported. Inbetween HTTP request/response the CPU is idled to near zero. Your expensive GPU/TPUs would sit idle.
  2. Autoscales based upon the number of requests per second. Launching 10,000 instances in seconds is easy to achieve. Imagine the billing support nightmare for Google if customers could launch that many GPU/TPUs and the size of the bills.
  3. Is billed in 100 ms time intervals. Most requests fit into a few hundred milliseconds of execution. This is not a good execution or business model for CPU/GPU/TPU integration.
  4. Provides a billing model which significantly reduces the cost of web services to near zero when not in use. You just pay for the costs to store your container images. When an HTTP request is received at the service URL, the container image is loaded into an execution unit and processing requests resume. Once requests stop, billing and resource usage also stop.

GPU/TPU types of data processing are best delivered by backend instances that protect and manage the processing power and costs that these processor devices provide.

like image 29
John Hanley Avatar answered Sep 29 '22 11:09

John Hanley