Vacuum analyze all tables in a schema postgres

Tags:

postgresql

I have a very large postgres database that has one particular schema in it which is dropped in and recreated nightly. After all of the tables in that schema are created I want to vacuum analyze them, however the database is so large that if a do a full db VACUUM ANALYZE; it takes about a half hour.

How can I go about vacuum analyzing each of the tables in this schema only without writing a separate SQL command for each table?

277

asked Apr 17 '15 22:04

Grant Humphries

1 Answers

The bash function below utilizes the CLI tool psql to vacuum analyze tables in a single schema which can be identified by either passing the name of the schema as the first parameter to the function or setting the environment variable PG_SCHEMA:

vacuum_analyze_schema() {
    # vacuum analyze only the tables in the specified schema

    # postgres info can be supplied by either passing it as parameters to this
    # function, setting environment variables or a combination of the two
    local pg_schema="${1:-${PG_SCHEMA}}"
    local pg_db="${2:-${PG_DB}}"
    local pg_user="${3:-${PG_USER}}"
    local pg_host="${4:-${PG_HOST}}"

    echo "Vacuuming schema \`${pg_schema}\`:"

    # extract schema table names from psql output and put them in a bash array
    local psql_tbls="\dt ${pg_schema}.*"
    local sed_str="s/${pg_schema}\s+\|\s+(\w+)\s+\|.*/\1/p"
    local table_names=$( echo "${psql_tbls}" | psql -d "${pg_db}" -U "${pg_user}" -h "${pg_host}"  | sed -nr "${sed_str}" )
    local tables_array=( $( echo "${table_names}" | tr '\n' ' ' ) )

    # loop through the table names creating and executing a vacuum
    # command for each one
    for t in "${tables_array[@]}"; do
        echo "doing table \`${t}\`..."
        psql -d "${pg_db}" -U "${pg_user}" -h "${pg_host}" \
            -c "VACUUM (ANALYZE) ${pg_schema}.${t};"
    done
}

This function can be added to your .bashrc to provide the ability to invoke it from the command line at any time. Like the schema, Postgres connection and database values can be set by either supplying them as function parameters:

# params must be in this order
vacuum_analyze_schema '<your-pg-schema>' '<your-pg-db>' '<your-pg-user>' '<your-pg-host>'

or by setting environment variables:

PG_SCHEMA='<your-pg-schema>'
PG_USER='<your-pg-user>'
PG_HOST='<your-pg-host>'
PG_DB='<your-pg-db>'

vacuum_analyze_schema

or by a combination of both. Values passed as params will take precedence over corresponding environment vars.

answered Sep 22 '22 07:09

Grant Humphries

Related questions
                            
                                How to write update function (stored procedure) in Postgresql?
                            
                                PostgreSQL - relation doesnt exist error when granting priviliges
                            
                                Read a Postgresql array directly into a Golang Slice
                            
                                Deleting a table in PostgreSQL without deleting an associated sequence
                            
                                Does not using NULL in PostgreSQL still use a NULL bitmap in the header?
                            
                                Get interval in milliseconds
                            
                                convert rows to string in postgresql
                            
                                Postgres append or set each elements(if not exists) of an array to an array column
                            
                                Update or create nested jsonb value using single update command
                            
                                How to know a timezone of a timestamp in postgresql 8.3
                            
                                What's the difference between B-Tree and GiST index methods (in PostgreSQL)?
                            
                                How to get column attributes query from table name using PostgreSQL?
                            
                                Difference between EXTRACT(year from timestamp) function and date_part('year', timestamp) in PostgreSQL
                            
                                PostgreSQL Trigger Error : control reached end of trigger procedure without RETURN
                            
                                AttributeError: 'UUID' object has no attribute 'replace' when using backend-agnostic GUID type
                            
                                ERROR: function dblink(unknown, unknown) does not exist
                            
                                How to set a nullable database field to NULL with typeorm?
                            
                                Django how to reconnect after DatabaseError: query timeout
                            
                                how to copy data from file to PostgreSQL using JDBC?
                            
                                How do I fix a PostgreSQL 9.3 Slave that Cannot Keep Up with the Master?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Vacuum analyze all tables in a schema postgres

Tags:

postgresql

Grant Humphries

People also ask

1 Answers

Grant Humphries

Recent Activity

Donate For Us