How can I prevent a Dockerfile instruction from being cached?

Tags:

In my Dockerfile I use curl or ADD to download the latest version of an archive like:

FROM debian:jessie
...
RUN apt-get install -y curl
...
RUN curl -sL http://example.com/latest/archive.tar.gz --output archive.tar.gz
...
ADD http://example.com/latest/archive2.tar.gz
...

The RUN statement that uses curl or ADD creates its own image layer. That will be used as a cache for future executions of docker build.

Question: How can I disable caching for that instructions?

It would be great to get something like cache invalidation working there. E.g. by using HTTP ETags or by querying the last modified header field. That would give the possibility to do a quick check based on the HTTP headers to decide whether a cached layer could be used or not.

I know that some dirty tricks could help e.g. executing a download shell script in the RUN statement instead. Its filename will be changed before the docker build is triggered by our build system. And I could do the HTTP checks inside that script. But then I need to store either the last used ETag or the last modified to a file somewhere. I am wondering whether there is some more clean and native Docker functionality that I could use, here.

407

asked Aug 03 '15 08:08

Henrik Sachse

2 Answers

A build-time argument can be specified to forcibly break the cache from that step onwards. For example, in your Dockerfile, put

ARG CACHE_DATE=not_a_date

and then give this argument a fresh value on every new build. The best, of course, is the timestamp.

docker build --build-arg CACHE_DATE=$(date +%Y-%m-%d:%H:%M:%S) ...

Make sure the value is a string without any spaces, otherwise docker client will falsely take it as multiple arguments.

See a detailed discussion on Issue 22832.

answered Oct 10 '22 19:10

Ruifeng Ma

docker build --no-cache would invalidate the cache for all the commands.

Dockerfile ADD command used to have the cache invalidated. Although it has been improved in recent docker version:

Docker is supposed to checksum any file added through ADDand then decide if it should use the cache or not.

So if the file added has changed, the cache should be invalidated for the ADD command.

Issue 1326 mentions other tips:

This worked.

RUN yum -y install firefox #redo

So it looks like Docker will re-run the step (and all the steps below it) if the string I am passing to RUN command changes in anyway - even it's just a comment.

The docker cache is used only, and only if none of his ancestor has changed (this behavior makes sense, as the next command will add change to the previous layer).

The cache is used if there isn't any character which has changed (so even a space is enough to invalidate a cache).

answered Oct 10 '22 18:10

VonC

Related questions
                            
                                Manually clear ASP.NET server cache for a single application/web site?
                            
                                Looking for a very simple Cache example
                            
                                Caching and gzip compression by htaccess
                            
                                React Native - Fetch call cached
                            
                                Performance issue with Volley's DiskBasedCache
                            
                                Chrome caching like a mad browser
                            
                                Cached non CORS response conflicts with new CORS request
                            
                                Asking browsers to cache as aggressively as possible
                            
                                How to tell OkHttpClient to ignore cache and force refresh from server?
                            
                                Get list of Cache Keys in Django
                            
                                Leverage browser caching for 3rd party JS
                            
                                How to cache Django Rest Framework API calls?
                            
                                How does Redis work when RAM starts filling up?
                            
                                Is caching a NSDateformatter application-wide good idea?
                            
                                LRU cache implementation in Javascript
                            
                                A Cache Efficient Matrix Transpose Program?
                            
                                Fastest way to loop through a 2d array?
                            
                                No expires header sent, content cached, how long until browser makes conditional GET request?
                            
                                Website needs force refresh after deploy
                            
                                How do I clear the server cache in asp.net?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I prevent a Dockerfile instruction from being cached?

Tags:

docker

curl

caching

dockerfile

Henrik Sachse

People also ask

2 Answers

Ruifeng Ma

VonC

Recent Activity

Donate For Us