How can I extract the size of the total uncompressed file data in a .tar.gz file from command line?

This works for any file size: <pre class="prettyprint"><code>zcat archive.tar.gz | wc -c </code></pre> For files smaller than 4Gb you could also use the -l option with gzip: <pre class="prettyprint"><code>$ gzip -l compressed.tar.gz compressed uncompressed ratio uncompressed_name 132 10240 99.1% compressed.tar </code></pre>

This will sum the total content size of the extracted files: <pre class="prettyprint"><code>$ tar tzvf archive.tar.gz | sed 's/ \+/ /g' | cut -f3 -d' ' | sed '2,$s/^/+ /' | paste -sd' ' | bc </code></pre> The output is given in bytes. Explanation: <code>tar tzvf</code> lists the files in the archive in verbose format like <code>ls -l</code>. <code>sed</code> and <code>cut</code> isolate the file size field. The second <code>sed</code> puts a + in front of every size except the first and <code>paste</code> concatenates them, giving a sum expression that is then evaluated by <code>bc</code>. Note that this doesn't include metadata, so the disk space taken up by the files when you extract them is going to be larger - potentially many times larger if you have a lot of very small files.

Check the total content size of a tar gz file

2 Answers

This works for any file size:

zcat archive.tar.gz | wc -c

For files smaller than 4Gb you could also use the -l option with gzip:

$ gzip -l compressed.tar.gz
     compressed        uncompressed  ratio uncompressed_name
            132               10240  99.1% compressed.tar

150

answered Oct 16 '22 14:10

Matthew Mott

This will sum the total content size of the extracted files:

$ tar tzvf archive.tar.gz | sed 's/ \+/ /g' | cut -f3 -d' ' | sed '2,$s/^/+ /' | paste -sd' ' | bc

The output is given in bytes.

Explanation: tar tzvf lists the files in the archive in verbose format like ls -l. sed and cut isolate the file size field. The second sed puts a + in front of every size except the first and paste concatenates them, giving a sum expression that is then evaluated by bc.

Note that this doesn't include metadata, so the disk space taken up by the files when you extract them is going to be larger - potentially many times larger if you have a lot of very small files.

answered Oct 16 '22 13:10

Ztyx

Related questions
                            
                                GZip every file separately
                            
                                Enable GZIP for CSS and JS files on NGINX server for Magento
                            
                                How to create tar.gz archive file in Windows? [closed]
                            
                                How to enable gzip HTTP compression on Windows Azure dynamic content
                            
                                How can I Zip and Unzip a string using GZIPOutputStream that is compatible with .Net?
                            
                                Decode gzipped web page retrieved via cURL in PHP
                            
                                gzip: stdin: not in gzip format tar: Child returned status 1 tar: Error is not recoverable: exiting now
                            
                                How to check if InputStream is Gzipped?
                            
                                uncompress a .txt.gz file in mac?
                            
                                How do you create a .gz file using PHP?
                            
                                HttpWebRequest & Native GZip Compression
                            
                                GZip Compression On IIS 7.5 is not working
                            
                                zlib.error: Error -3 while decompressing: incorrect header check
                            
                                What 'Content-Type' header to use when serving gzipped files?
                            
                                What is gZip compression?
                            
                                Compression formats with good support for random access within archives?
                            
                                Why do real-world servers prefer gzip over deflate encoding?
                            
                                Import and insert sql.gz file into database with putty
                            
                                Why can't browser send gzip request?
                            
                                How can I get gzip compression in IIS7 working?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Check the total content size of a tar gz file

Tags:

gzip

tar

Ztyx

People also ask

2 Answers

Matthew Mott

Ztyx

Recent Activity

Donate For Us