I use the following Github Actions workflow for my C project. The workflow finishes in ~40 seconds, but more than half of that time is spent by installing the <code>valgrind</code> package and its dependencies. I believe caching could help me speed up the workflow. I do not mind waiting a couple of extra seconds, but this just seems like a pointless waste of GitHub's resources. <pre class="prettyprint"><code>name: C Workflow on: [push, pull_request] jobs: build: runs-on: ubuntu-latest steps: - uses: actions/checkout@v1 - name: make run: make - name: valgrind run: | sudo apt-get install -y valgrind valgrind -v --leak-check=full --show-leak-kinds=all ./bin </code></pre> Running <code>sudo apt-get install -y valgrind</code> installs the following packages: <ul> <li><code>gdb</code></li> <li><code>gdbserver</code></li> <li><code>libbabeltrace1</code></li> <li><code>libc6-dbg</code></li> <li><code>libipt1</code></li> <li><code>valgrind</code></li> </ul> I know Actions support caching of a specific directory (and there are already several answered SO questions and articles about this), but I am not sure where all the different packages installed by apt end up. I assume <code>/bin/</code> or <code>/usr/bin/</code> are not the only directories affected by installing packages. Is there an elegant way to cache the installed system packages for future workflow runs?

The purpose of this answer is to show how caching can be done with github actions. Not necessarily to show how to cache <code>valgrind</code>, which it does show, but more so to show that not everything can/should be cached, and that the tradeoffs of caching and restoring a cache, vs reinstalling the dependency needs to be taken into account. <hr> You will make use of the <code>actions/cache</code> action to do this. Add it as a step (before you need to use valgrind): <pre class="prettyprint"><code>- name: Cache valgrind uses: actions/cache@v2 id: cache-valgrind with: path: "~/valgrind" key: ${{secrets.VALGRIND_VERSION}} </code></pre> The next step should attempt to install the cached version if any or install from the repositories: <pre class="prettyprint"><code>- name: Install valgrind env: CACHE_HIT: ${{steps.cache-valgrind.outputs.cache-hit}} VALGRIND_VERSION: ${{secrets.VALGRIND_VERSION}} run: | if [[ "$CACHE_HIT" == 'true' ]]; then sudo cp --verbose --force --recursive ~/valgrind/* / else sudo apt-get install --yes valgrind="$VALGRIND_VERSION" mkdir -p ~/valgrind sudo dpkg -L valgrind | while IFS= read -r f; do if test -f $f; then echo $f; fi; done | xargs cp --parents --target-directory ~/valgrind/ fi </code></pre> <h3>Explanation</h3> Set <code>VALGRIND_VERSION</code> secret to be the output of: <pre class="prettyprint"><code>apt-cache policy valgrind | grep -oP '(?<=Candidate:\s)(.+)' </code></pre> this will allow you to invalidate the cache when a new version is released simply by changing the value of the secret. <code>dpkg -L valgrind</code> is used to list all the files installed when using <code>sudo apt-get install valgrind</code>. What we can now do with this command is to copy all the dependencies to our cache folder: <pre class="prettyprint"><code>dpkg -L valgrind | while IFS= read -r f; do if test -f $f; then echo $f; fi; done | xargs cp --parents --target-directory ~/valgrind/ </code></pre> <hr> <h3>Furthermore</h3> In addition to copying all the components of <code>valgrind</code>, it may also be necessary to copy the dependencies (such as <code>libc</code> in this case), but I don't recommend continuing along this path because the dependency chain just grows from there. To be precise, the dependencies needed to copy to finally have an environment suitable for valgrind to run in is as follows: <ul> <li>libc6</li> <li>libgcc1</li> <li>gcc-8-base</li> </ul> To copy all these dependencies, you can use the same syntax as above: <pre class="prettyprint"><code>for dep in libc6 libgcc1 gcc-8-base; do dpkg -L $dep | while IFS= read -r f; do if test -f $f; then echo $f; fi; done | xargs cp --parents --target-directory ~/valgrind/ done </code></pre> Is all this work really worth the trouble when all that is required to install <code>valgrind</code> in the first place is to simply run <code>sudo apt-get install valgrind</code>? If your goal is to speed up the build process, then you also have to take into consideration the amount of time it is taking to restore (downloading, and extracting) the cache vs simply running the command again to install <code>valgrind</code>. <hr> And finally to restore the cache, assuming it is stored at <code>/tmp/valgrind</code>, you can use the command: <pre class="prettyprint"><code>cp --force --recursive /tmp/valgrind/* / </code></pre> Which will basically copy all the files from the cache unto the root partition. In addition to the process above, I also have an example of "caching valgrind" by installing and compiling it from source. The cache is now about 63MB (compressed) in size and one still needs to separately install <code>libc</code> which kind of defeats the purpose. <hr> Note: Another answer to this question proposes what I could consider to be a safer approach to caching dependencies, by using a container which comes with the dependencies pre-installed. The best part is that you can use actions to keep those containers up-to-date. References: <ul> <li>https://askubuntu.com/a/408785</li> <li>https://unix.stackexchange.com/questions/83593/copy-specific-file-type-keeping-the-folder-structure</li> </ul>

You could create a docker image with <code>valgrind</code> preinstalled and run your workflow on that. Create a <code>Dockerfile</code> with something like: <pre class="prettyprint"><code>FROM ubuntu RUN apt-get install -y valgrind </code></pre> Build it and push it to dockerhub: <pre class="prettyprint lang-sh prettyprint-override"><code>docker build -t natiiix/valgrind . docker push natiiix/valgrind </code></pre> Then use something like the following as your workflow: <pre class="prettyprint lang-yaml prettyprint-override"><code>name: C Workflow on: [push, pull_request] jobs: build: container: natiiix/valgrind steps: - uses: actions/checkout@v1 - name: make run: make - name: valgrind run: valgrind -v --leak-check=full --show-leak-kinds=all ./bin </code></pre> Completely untested, but you get the idea.

Caching APT packages in GitHub Actions workflow

Tags:

apt

github-actions

I use the following Github Actions workflow for my C project. The workflow finishes in ~40 seconds, but more than half of that time is spent by installing the valgrind package and its dependencies.

I believe caching could help me speed up the workflow. I do not mind waiting a couple of extra seconds, but this just seems like a pointless waste of GitHub's resources.

name: C Workflow  on: [push, pull_request]  jobs:   build:     runs-on: ubuntu-latest      steps:     - uses: actions/checkout@v1      - name: make       run: make      - name: valgrind       run: |         sudo apt-get install -y valgrind         valgrind -v --leak-check=full --show-leak-kinds=all ./bin

Running sudo apt-get install -y valgrind installs the following packages:

gdb
gdbserver
libbabeltrace1
libc6-dbg
libipt1
valgrind

I know Actions support caching of a specific directory (and there are already several answered SO questions and articles about this), but I am not sure where all the different packages installed by apt end up. I assume /bin/ or /usr/bin/ are not the only directories affected by installing packages.

Is there an elegant way to cache the installed system packages for future workflow runs?

421

asked Dec 10 '19 14:12

natiiix

2 Answers

The purpose of this answer is to show how caching can be done with github actions. Not necessarily to show how to cache valgrind, which it does show, but more so to show that not everything can/should be cached, and that the tradeoffs of caching and restoring a cache, vs reinstalling the dependency needs to be taken into account.

You will make use of the actions/cache action to do this.

Add it as a step (before you need to use valgrind):

- name: Cache valgrind   uses: actions/cache@v2   id: cache-valgrind   with:       path: "~/valgrind"       key: ${{secrets.VALGRIND_VERSION}}

The next step should attempt to install the cached version if any or install from the repositories:

- name: Install valgrind   env:     CACHE_HIT: ${{steps.cache-valgrind.outputs.cache-hit}}     VALGRIND_VERSION: ${{secrets.VALGRIND_VERSION}}   run: |       if [[ "$CACHE_HIT" == 'true' ]]; then         sudo cp --verbose --force --recursive ~/valgrind/* /       else         sudo apt-get install --yes valgrind="$VALGRIND_VERSION"         mkdir -p ~/valgrind         sudo dpkg -L valgrind | while IFS= read -r f; do if test -f $f; then echo $f; fi; done | xargs cp --parents --target-directory ~/valgrind/       fi

Explanation

Set VALGRIND_VERSION secret to be the output of:

apt-cache policy valgrind | grep -oP '(?<=Candidate:\s)(.+)'

this will allow you to invalidate the cache when a new version is released simply by changing the value of the secret.

dpkg -L valgrind is used to list all the files installed when using sudo apt-get install valgrind.

What we can now do with this command is to copy all the dependencies to our cache folder:

dpkg -L valgrind | while IFS= read -r f; do if test -f $f; then echo $f; fi; done | xargs cp --parents --target-directory ~/valgrind/

Furthermore

In addition to copying all the components of valgrind, it may also be necessary to copy the dependencies (such as libc in this case), but I don't recommend continuing along this path because the dependency chain just grows from there. To be precise, the dependencies needed to copy to finally have an environment suitable for valgrind to run in is as follows:

libc6
libgcc1
gcc-8-base

To copy all these dependencies, you can use the same syntax as above:

for dep in libc6 libgcc1 gcc-8-base; do     dpkg -L $dep | while IFS= read -r f; do if test -f $f; then echo $f; fi; done | xargs cp --parents --target-directory ~/valgrind/ done

Is all this work really worth the trouble when all that is required to install valgrind in the first place is to simply run sudo apt-get install valgrind? If your goal is to speed up the build process, then you also have to take into consideration the amount of time it is taking to restore (downloading, and extracting) the cache vs simply running the command again to install valgrind.

And finally to restore the cache, assuming it is stored at /tmp/valgrind, you can use the command:

cp --force --recursive /tmp/valgrind/* /

Which will basically copy all the files from the cache unto the root partition.

In addition to the process above, I also have an example of "caching valgrind" by installing and compiling it from source. The cache is now about 63MB (compressed) in size and one still needs to separately install libc which kind of defeats the purpose.

Note: Another answer to this question proposes what I could consider to be a safer approach to caching dependencies, by using a container which comes with the dependencies pre-installed. The best part is that you can use actions to keep those containers up-to-date.

References:

https://askubuntu.com/a/408785
https://unix.stackexchange.com/questions/83593/copy-specific-file-type-keeping-the-folder-structure

134

answered Sep 18 '22 14:09

smac89

You could create a docker image with valgrind preinstalled and run your workflow on that.

Create a Dockerfile with something like:

FROM ubuntu  RUN apt-get install -y valgrind

Build it and push it to dockerhub:

docker build -t natiiix/valgrind . docker push natiiix/valgrind

Then use something like the following as your workflow:

name: C Workflow  on: [push, pull_request]  jobs:   build:     container: natiiix/valgrind      steps:     - uses: actions/checkout@v1      - name: make       run: make      - name: valgrind       run: valgrind -v --leak-check=full --show-leak-kinds=all ./bin

Completely untested, but you get the idea.

answered Sep 21 '22 14:09

deivid

Related questions
                            
                                Handling input confirmations in Linux shell scripting
                            
                                Clone Debian/Ubuntu installation [closed]
                            
                                Why do I see Failed to fetch error while dping apt-get update
                            
                                W: Failed to fetch http://deb.debian.org/debian/dists/jessie-updates/main/binary-amd64/Packages 404 Not Found [IP: 151.101.140.204 80]
                            
                                List directory /var/lib/apt/lists/partial is missing. - Acquire (20: Not a directory)
                            
                                How do I preserve installed applications when migrating Ubuntu to another platform?
                            
                                Java annotation processing with source code manipulation
                            
                                APT (Annotation Processing Tool)
                            
                                how to install debian package from unsigned repository
                            
                                Google Compute Engine Ubuntu 17.04 zesty does no longer have a Release file
                            
                                Can't remove, purge, unistall mongodb from debian
                            
                                apt-get install via tunnel proxy but ssh only from client side
                            
                                How to remove a snap application (docker) completely
                            
                                How to get parameter type from javax.lang.model.VariableElement
                            
                                Dockerfile: Benefits of repeated apt cache cleans
                            
                                dpkg error: pycompile: not found
                            
                                Java 6 annotation processing configuration with Ant
                            
                                Could not select 'OK' in mysql-apt-config [Ubuntu 14.04]
                            
                                How to make Debian package install dependencies?
                            
                                Stuck with apt --fix-broken install (libc6:amd64 package post-installation)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With