Shared hit cache in postgreSQL

Tags:

I'm experimenting with the EXPLAIN command and trying to find out what the shared hit is.

Seq Scan on foo  (cost=0.00..18334.00 rows=1000000 width=37) (actual time=0.030..90.500 rows=1000000 loops=1)
  Buffers: shared hit=512 read=7822
Total runtime: 116.080 ms

I've noticed that the more shared hit number we have the faster a query is being executed. But what is that? As far as I got, shared read is just reading from a physical storage like RAID or SSD. But why is the shared hit faster? Is it stored inside RAM or where?

688

asked Oct 02 '15 12:10

St.Antario

1 Answers

shared hit essentially means the value has already been cached in the main memory of the computer and it was not necessary to read this from the hard disk.

Accessing the main memory (RAM) is much faster than reading values from the hard disk. And that's why the query is faster the more share hits it has.

Immediately after starting Postgres, none of the data is available in the main memory (RAM) and everything needs to be read from the hard disk.

Consider this step from an execution plan:

  ->  Seq Scan on products.product_price  (cost=0.00..3210.27 rows=392273 width=0) (actual time=0.053..103.958 rows=392273 loops=1)
        Output: product_id, valid_from, valid_to, price
        Buffers: shared read=2818
        I/O Timings: read=48.382

The part "Buffers: shared read=2818" means that 2818 blocks (each 8k) had to be read from the hard disk (and that took 48ms - I have a SSD). Those 2818 blocks were stored in the cache (the "shared buffers") so that the next time they are needed the database does not need to read them (again) from the (slow) hard disk.

When I re-run that statement the plan changes to:

  ->  Seq Scan on products.product_price  (cost=0.00..3210.27 rows=392273 width=0) (actual time=0.012..45.690 rows=392273 loops=1)
        Output: product_id, valid_from, valid_to, price
        Buffers: shared hit=2818

Which means that those 2818 blocks that the previous statement were still in the main memory (=RAM) and Postgres did not need to read them from the hard disk.

"memory" always refers to the main memory (RAM) built into the computer and directly accessible to the CPU - as opposed to "external storage".

There are several presentations on how Postgres manages the shared buffers:

http://de.slideshare.net/EnterpriseDB/insidepostgressharedmemory2015
http://momjian.us/main/writings/pgsql/hw_performance/
https://2ndquadrant.com/media/pdfs/talks/InsideBufferCache.pdf (very technical)
http://raghavt.blogspot.de/2012/04/caching-in-postgresql.html

139

answered Sep 19 '22 11:09

a_horse_with_no_name

Related questions
                            
                                How to use MySQL index columns?
                            
                                About index and primary key in SQL?
                            
                                SQLite long to wide formats?
                            
                                Which jar to use for connecting to MS SQL server
                            
                                Select or boolean aggregate function in PostgreSQL
                            
                                sql server 2008 management studio not checking the syntax of my query
                            
                                How to find which columns don't have any data (all values are NULL)?
                            
                                Is a SQL 'not in' more 'expensive' than a SQL 'in'?
                            
                                users assigned a sql azure role
                            
                                How to convert int to date in SQL Server 2008
                            
                                Nesting Aggregate Functions - SQL
                            
                                How can I check SQL syntax for a JDBC statement without running the actual query?
                            
                                commit after select
                            
                                unknown database in jdbc
                            
                                Is order in a subquery guaranteed to be preserved?
                            
                                Unable to restore bacpac due to foreign key conflict
                            
                                Many tables or rows, which one is more efficient in SQL?
                            
                                How to call mysql function using querydsl?
                            
                                Is LIMIT clause in HIVE really random?
                            
                                How to use variables in SQL raiserror

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Shared hit cache in postgreSQL

Tags:

sql

caching

postgresql

St.Antario

People also ask

1 Answers

a_horse_with_no_name

Recent Activity

Donate For Us