I have a <code>Dockerfile</code> based on <code>nvidia/cuda</code> like so: <pre class="prettyprint"><code>FROM nvidia/cuda:11.0-base ... </code></pre> I want to be able to build this <code>Dockerfile</code> on our CI server that does not have a Nvidia GPU. When I try to do that, I get this error: <pre class="prettyprint"><code>------ > [1/6] FROM docker.io/nvidia/cuda:11.0-base: ------ failed to solve with frontend dockerfile.v0: failed to solve with frontend gateway.v0: rpc error: code = Unknown desc = failed to build LLB: failed to load cache key: docker.io/nvidia/cuda:11.0-base not found </code></pre> The error says that the image is not found, but I think this is a bit misleading. I've been able to isolate the problem to whether or not a GPU is present. When building this <code>Dockerfile</code> on a server with a Nvidia GPU, I don't get this error. Is it possible to build a <code>Dockerfile</code> based on an <code>nvidia/cuda</code> image on a server without a GPU? This would save costs on our CI server. I plan to deploy the resulting docker container on a server that does have a GPU so, in other words, is it possible to defer the presence of a GPU to run time instead of build time?

It sounds like you may need to load the nvidia components possibly including any proprietary blobs and kernel modules. If the modules are not present, this could be why the compile error (missing dependencies). But from this website https://docs.nvidia.com/datacenter/tesla/tesla-installation-notes/index.html it looks like the drivers are looking for the hardware when they load, which is probably why they are not available when you attempt to compile.

Is it possible to build an `nvidia/cuda`-based image on a server without a GPU?

Tags:

docker

continuous-integration

gpu

nvidia-docker

I have a Dockerfile based on nvidia/cuda like so:

FROM nvidia/cuda:11.0-base

...

I want to be able to build this Dockerfile on our CI server that does not have a Nvidia GPU. When I try to do that, I get this error:

------
 > [1/6] FROM docker.io/nvidia/cuda:11.0-base:
------
failed to solve with frontend dockerfile.v0: failed to solve with frontend gateway.v0: rpc error: code = Unknown desc = failed to build LLB: failed to load cache key: docker.io/nvidia/cuda:11.0-base not found

The error says that the image is not found, but I think this is a bit misleading. I've been able to isolate the problem to whether or not a GPU is present.

When building this Dockerfile on a server with a Nvidia GPU, I don't get this error. Is it possible to build a Dockerfile based on an nvidia/cuda image on a server without a GPU? This would save costs on our CI server.

I plan to deploy the resulting docker container on a server that does have a GPU so, in other words, is it possible to defer the presence of a GPU to run time instead of build time?

746

asked Aug 07 '20 21:08

Mario Ishac

1 Answers

It sounds like you may need to load the nvidia components possibly including any proprietary blobs and kernel modules. If the modules are not present, this could be why the compile error (missing dependencies).

But from this website https://docs.nvidia.com/datacenter/tesla/tesla-installation-notes/index.html it looks like the drivers are looking for the hardware when they load, which is probably why they are not available when you attempt to compile.

answered Oct 04 '22 00:10

Hmbl Stdnt

Related questions
                            
                                Connect to HBase running in Docker
                            
                                Docker setting up
                            
                                no django app created when following the docker-compose tutorial
                            
                                Linux Namespaces: Is it possible for a network namespace to exist without being associated with a process?
                            
                                How to connect to mysql running on docker using Sequel Pro
                            
                                How to specify an iterator in the volume path when using docker-compose to scale up service?
                            
                                Docker Trusted Registry - Unable to satisfy available container slot
                            
                                Jenkins publish HTML from docker container
                            
                                Docker Compose host path error with nginx
                            
                                Maintaining yarn.lock cross-platform?
                            
                                Run docker cp excluding files from .dockerignore
                            
                                use image-file in docker-compose file
                            
                                Headless Chrome Blank White Screen
                            
                                Sqoop - Import Job failed
                            
                                No executable found matching command "dotnet-/app/Build\ClearPluginAssemblies.dll" Docker
                            
                                Change Kubernetes docker-for-desktop cluster network ip
                            
                                Is it possible to define a task timeout on AWS ECS?
                            
                                How do I run Docker in Docker on Heroku?
                            
                                How to perform Health check in a docker console app
                            
                                Self-contained Docker image with Laravel app (no shared volume)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With