How can I pipe a tar compression operation to aws s3 cp?

Tags:

I'm writing a custom backup script in bash for personal use. The goal is to compress the contents of a directory via tar/gzip, split the compressed archive, then upload the parts to AWS S3.

On my first try writing this script a few months ago, I was able to get it working via something like:

tar -czf - /mnt/STORAGE_0/dir_to_backup | split -b 100M -d -a 4 - /mnt/SCRATCH/backup.tgz.part
aws s3 sync /mnt/SCRATCH/ s3://backups/ --delete
rm /mnt/SCRATCH/*

This worked well for my purposes, but required /mnt/SCRATCH to have enough disk space to store the compressed directory. Now I wanted to improve this script to not have to rely on having enough space in /mnt/SCRATCH, and did some research. I ended up with something like:

tar -czf - /mnt/STORAGE_0/dir_to_backup | split -b 100M -d -a 4 --filter "aws s3 cp - s3://backups/backup.tgz.part" -

This almost works, but the target filename on my S3 bucket is not dynamic, and it seems to just overwrite the backup.tgz.part file several times while running. The end result is just one 100MB file, vs the intended several 100MB files with endings like .part0001.

Any guidance would be much appreciated. Thanks!

516

asked Jul 17 '19 15:07

alonzoc1

1 Answers

when using split you can use the env variable $FILE to get the generated file name. See split man page:

--filter=COMMAND
     write to shell COMMAND; file name is $FILE

For your use case you could use something like the following:

--filter 'aws s3 cp - s3://backups/backup.tgz.part$FILE'

(the single quotes are needed, otherwise the environment variable substitution will happen immediately)

Which will generate the following file names on aws:

backup.tgz.partx0000
backup.tgz.partx0001
backup.tgz.partx0002
...

Full example:

tar -czf - /mnt/STORAGE_0/dir_to_backup | split -b 100M -d -a 4 --filter 'aws s3 cp - s3://backups/backup.tgz.part$FILE' -

answered Oct 29 '22 07:10

Turtlefight

Related questions
                            
                                What is the conditional test in bash to determine if a variabl $f is a directory? [duplicate]
                            
                                Terminal display of input goes out of sync while/after using python? (temporary fix = `reset`)
                            
                                bash child script exits along with parent script when parent invoked interactively / by terminal, but not when invoked non-interactively / by cron
                            
                                "Permission denied" When run Command On Startup
                            
                                xdotool type takes ages and causes entire desktop to freeze
                            
                                How to let putty change its cursor shape accordingly?
                            
                                How to trap a SIGNAL in a java application initialized using a bash script
                            
                                Set trap in bash for different process with PID known
                            
                                bash scripting: how to get item name on a radiolist using dialog
                            
                                How to match a "something or nothing" in a bash regex?
                            
                                Reference stdout (i.e. output of previous command) quickly in bash?
                            
                                Check if the previous command was "valid"
                            
                                How to know if MKL is installed?
                            
                                Print bash script result behind prompt in the next line
                            
                                CodeDeploy PM2 Command Not Found
                            
                                How to build FFMPEG for all architecture of android device?
                            
                                Silent Install Weblogic in a Vagrant VM
                            
                                Tar recursive compare with original folder
                            
                                Compare checksum of files between two servers and report mismatch
                            
                                Sed failing to replace string in quotes during Gitlab CI

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I pipe a tar compression operation to aws s3 cp?

Tags:

bash

pipe

tar

amazon-s3

aws-sdk

alonzoc1

People also ask

1 Answers

Turtlefight

Recent Activity

Donate For Us