I got a large (>100M rows) Postgres table with structure {integer, integer, integer, timestamp without time zone}. I expected the size of a row to be 3*integer + 1*timestamp = 3*4 + 1*8 = 20 bytes. In reality the row size is <code>pg_relation_size(tbl) / count(*)</code> = 52 bytes. Why? (No deletes are done against the table: <code>pg_relation_size(tbl, 'fsm')</code> ~= 0)

Calculation of row size is much more complex than that. Storage is typically partitioned in 8 kB data pages. There is a small fixed overhead per page, possible remainders not big enough to fit another tuple, and more importantly dead rows or a percentage initially reserved with the <code>FILLFACTOR</code> setting. And there is even more overhead per row (tuple): an item identifier of 4 bytes at the start of the page, the <code>HeapTupleHeader</code> of 23 bytes and alignment padding. The start of the tuple header as well as the start of tuple data are aligned at a multiple of <code>MAXALIGN</code>, which is 8 bytes on a typical 64-bit machine. Some data types require alignment to the next multiple of 2, 4 or 8 bytes. Quoting the manual on the system table <code>pg_tpye</code>: <blockquote> <code>typalign</code> is the alignment required when storing a value of this type. It applies to storage on disk as well as most representations of the value inside PostgreSQL. When multiple values are stored consecutively, such as in the representation of a complete row on disk, padding is inserted before a datum of this type so that it begins on the specified boundary. The alignment reference is the beginning of the first datum in the sequence. Possible values are: <ul> <li><code>c</code> = <code>char</code> alignment, i.e., no alignment needed.</li> <li><code>s</code> = <code>short</code> alignment (2 bytes on most machines).</li> <li><code>i</code> = <code>int</code> alignment (4 bytes on most machines).</li> <li><code>d</code> = <code>double</code> alignment (8 bytes on many machines, but by no means all).</li> </ul> </blockquote> Read about the basics in the manual here. <h3>Your example</h3> This results in 4 bytes of padding after your 3 <code>integer</code> columns, because the <code>timestamp</code> column requires <code>double</code> alignment and needs to start at the next multiple of 8 bytes. So, one row occupies: <pre class="prettyprint"><code> 23 -- heaptupleheader + 1 -- padding or NULL bitmap + 12 -- 3 * integer (no alignment padding here) + 4 -- padding after 3rd integer + 8 -- timestamp + 0 -- no padding since tuple ends at multiple of MAXALIGN </code></pre> Plus item identifier per tuple in the page header (as pointed out by @A.H. in the comment): <pre class="prettyprint"><code> + 4 -- item identifier in page header ------ = 52 bytes </code></pre> So we arrive at the observed 52 bytes. The calculation <code>pg_relation_size(tbl) / count(*)</code> is a pessimistic estimation. <code>pg_relation_size(tbl)</code> includes bloat (dead rows) and space reserved by <code>fillfactor</code>, as well as overhead per data page and per table. (And we didn't even mention compression for long <code>varlena</code> data in TOAST tables, since it doesn't apply here.) You can install the additional module pgstattuple and call <code>SELECT * FROM pgstattuple('tbl_name');</code> for more information on table and tuple size. Related: <ul> <li>Table size with page layout</li> <li>Calculating and saving space in PostgreSQL</li> </ul>

Each row has metadata associated with it. The correct formula is (assuming naïve alignment): <pre class="prettyprint"><code>3 * 4 + 1 * 8 == your data 24 bytes == row overhead total size per row: 23 + 20 </code></pre> Or roughly 53 bytes. I actually wrote postgresql-varint specifically to help with this problem with this exact use case. You may want to look at a similar post for additional details re: tuple overhead.

Making sense of Postgres row sizes

2 Answers

Calculation of row size is much more complex than that.

Storage is typically partitioned in 8 kB data pages. There is a small fixed overhead per page, possible remainders not big enough to fit another tuple, and more importantly dead rows or a percentage initially reserved with the FILLFACTOR setting.

And there is even more overhead per row (tuple): an item identifier of 4 bytes at the start of the page, the HeapTupleHeader of 23 bytes and alignment padding. The start of the tuple header as well as the start of tuple data are aligned at a multiple of MAXALIGN, which is 8 bytes on a typical 64-bit machine. Some data types require alignment to the next multiple of 2, 4 or 8 bytes.

Quoting the manual on the system table pg_tpye:

typalign is the alignment required when storing a value of this type. It applies to storage on disk as well as most representations of the value inside PostgreSQL. When multiple values are stored consecutively, such as in the representation of a complete row on disk, padding is inserted before a datum of this type so that it begins on the specified boundary. The alignment reference is the beginning of the first datum in the sequence.

Possible values are:

c = char alignment, i.e., no alignment needed.

s = short alignment (2 bytes on most machines).

i = int alignment (4 bytes on most machines).

d = double alignment (8 bytes on many machines, but by no means all).

Read about the basics in the manual here.

Your example

This results in 4 bytes of padding after your 3 integer columns, because the timestamp column requires double alignment and needs to start at the next multiple of 8 bytes.

So, one row occupies:

   23   -- heaptupleheader  +  1   -- padding or NULL bitmap  + 12   -- 3 * integer (no alignment padding here)  +  4   -- padding after 3rd integer  +  8   -- timestamp  +  0   -- no padding since tuple ends at multiple of MAXALIGN

Plus item identifier per tuple in the page header (as pointed out by @A.H. in the comment):

 +  4   -- item identifier in page header ------  = 52 bytes

So we arrive at the observed 52 bytes.

The calculation pg_relation_size(tbl) / count(*) is a pessimistic estimation. pg_relation_size(tbl) includes bloat (dead rows) and space reserved by fillfactor, as well as overhead per data page and per table. (And we didn't even mention compression for long varlena data in TOAST tables, since it doesn't apply here.)

You can install the additional module pgstattuple and call SELECT * FROM pgstattuple('tbl_name'); for more information on table and tuple size.

Table size with page layout
Calculating and saving space in PostgreSQL

answered Sep 21 '22 03:09

Erwin Brandstetter

Each row has metadata associated with it. The correct formula is (assuming naïve alignment):

3 * 4 + 1 * 8 == your data 24 bytes == row overhead total size per row: 23 + 20

Or roughly 53 bytes. I actually wrote postgresql-varint specifically to help with this problem with this exact use case. You may want to look at a similar post for additional details re: tuple overhead.

answered Sep 23 '22 03:09

Sean

Related questions
                            
                                Specify which fields are indexed in ElasticSearch
                            
                                Is it possible to random_shuffle an array of int elements?
                            
                                Converting byte array values in little endian order to short values
                            
                                Responsive design vs adaptive design
                            
                                Delete all Git Commit History
                            
                                How to know if a type is a specialization of std::vector?
                            
                                Can't update with schema compare
                            
                                How to pass data from a fragment to a dialogFragment
                            
                                Regular expression with the cyrillic alphabet
                            
                                Synchronized scrolling using jQuery?
                            
                                Vector intersection in C++
                            
                                Receiving POST data in Rails 4 and reading request.body

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Making sense of Postgres row sizes

Tags:

Arman

People also ask

2 Answers

Your example

Erwin Brandstetter

Sean

Recent Activity

Donate For Us