Why PostgreSQL(timescaledb) costs more storage in table?

Tags:

I'm new to database. Recently I start using timescaledb, which is an extension in PostgreSQL, so I guess this is also PostgreSQL related.

I observed a strange behavior. I calculated my table structure, 1 timestamp, 2 double, so totally 24bytes per row. And I imported (by psycopg2 copy_from) 2,750,182 rows from csv file. I manually calculated the size should be 63MB, but I query timescaledb, it tells me the table size is 137MB, index size is 100MB and total 237MB. I was expecting that the table size should equal my calculation, but it doesn't. Any idea?

325

asked Nov 23 '17 21:11

Xiang Zhang

1 Answers

There are two basic reasons your table is bigger than you expect: 1. Per tuple overhead in Postgres 2. Index size

Per tuple overhead: An answer to a related question goes into detail that I won't repeat here but basically Postgres uses 23 (+padding) bytes per row for various internal things, mostly multi-version concurrency control (MVCC) management (Bruce Momjian has some good intros if you want more info). Which gets you pretty darn close to the 137 MB you are seeing. The rest might be because of either the fill factor setting of the table or if there are any dead rows still included in the table from say a previous insert and subsequent delete.
Index Size: Unlike some other DBMSs Postgres does not organize its tables on disk around an index, unless you manually cluster the table on an index, and even then it will not maintain the clustering over time (see https://www.postgresql.org/docs/10/static/sql-cluster.html). Rather it keeps its indices separately, which is why there is extra space for your index. If on-disk size is really important to you and you aren't using your index for, say, uniqueness constraint enforcement, you might consider a BRIN index, especially if your data is going in with some ordering (see https://www.postgresql.org/docs/10/static/brin-intro.html).

185

answered Sep 19 '22 13:09

davidk

Related questions
                            
                                Why are the results for 1 = NULL and 1 != NULL the same?
                            
                                Prepending an * (asterisk) to a Fulltext Search in MySQL
                            
                                MySQL: Update every second row string
                            
                                named parameters in database/sql and database/sql/driver
                            
                                Android SQL - Check if whole row already exists in database
                            
                                Database schema for Books, Authors, Publishers and Users with bookshelves
                            
                                MySQL Error Code: 1349. View's SELECT contains a subquery in the FROM clause
                            
                                delete sqlite database when updating new version of application
                            
                                Better to query once, then organize objects based on returned column value, or query twice with different conditions?
                            
                                Laravel filtering hasMany results
                            
                                Calculate the time difference between two timestamps in mysql
                            
                                Mysql Server: Unable to connect to remote host. Catalog download has failed
                            
                                MySql: Get count of incremented items by multiple conditions
                            
                                How to increment primary key during postgres COPY batch insert?
                            
                                In Entity Framework, how do I add a generic entity to its corresponding DbSet without a switch statement that enumerates all the possible DbSets?
                            
                                mysql workbench migrate database with two different names
                            
                                How to sum varchar datatype by converting varchar into float
                            
                                How to automatically call a query for every new connection in MySQL?
                            
                                How to update `xarray.DataArray` using `.sel()` indexer?
                            
                                remove all NULL valued rows from table?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why PostgreSQL(timescaledb) costs more storage in table?

Tags:

database

postgresql

timescaledb

Xiang Zhang

People also ask

1 Answers

davidk

Recent Activity

Donate For Us