Amazon Redshift at 100% disk usage due to VACUUM query

Tags:

Reading the Amazon Redshift documentatoin I ran a VACUUM on a certain 400GB table which has never been vacuumed before, in attempt to improve query performance. Unfortunately, the VACUUM has caused the table to grow to 1.7TB (!!) and has brought the Redshift's disk usage to 100%. I then tried to stop the VACUUM by running a CANCEL query in the super user queue (you enter it by running "set query_group='superuser';") but although the query didn't raise an error, this had no effect on the vaccum query which keeps running.

What can I do?

748

asked Jul 16 '14 12:07

Maxim Kogan

2 Answers

Hint: Run this query: (taken from here) to see what tables you should vacuum.

Note: This will help only in the case where you want to know which tables are big, and what you can gain by vacuuming each one.

select trim(pgdb.datname) as Database,
    trim(a.name) as Table,  ((b.mbytes/part.total::decimal)*100)::decimal(5,2) as pct_of_total, b.mbytes, b.unsorted_mbytes
    from stv_tbl_perm a
    join pg_database as pgdb on pgdb.oid = a.db_id
    join (select tbl, sum(decode(unsorted, 1, 1, 0)) as unsorted_mbytes, count(*) as mbytes
    from stv_blocklist group by tbl) b on a.id=b.tbl
    join ( select sum(capacity) as  total
      from stv_partitions where part_begin=0 ) as part on 1=1
    where a.slice=0
    order by 3 desc, db_id, name;

Then vacuum table(s) with high unsorted_mbytes: VACUUM your_table;

184

answered Sep 27 '22 17:09

Benjamin Crouzier

I have stopped vacuum operation several times. Maybe the feature was not available that time.
Run the below query, which gives you the process id for vacuum query.

select * from stv_recents where status='Running';

Once you have process id you can run the following query to terminate the process.

select pg_terminate_backend( pid );

answered Sep 27 '22 17:09

Rahul Gupta

Related questions
                            
                                how to connect Django in EC2 to a Postgres database in RDS?
                            
                                How to extract keys from multiple, nested arrays using jq
                            
                                Deleting Data from DynamoDb Table automatically
                            
                                How to install cffi package on AWS Beanstalk
                            
                                AWS cloudfront url rewrite
                            
                                Celery message queue vs AWS Lambda task processing
                            
                                An example of calling AWS Rekognition HTTP API from Python
                            
                                How to refer a derived variable in Cloudformation
                            
                                Cognito without Verifying Email
                            
                                How do I force a CloudFormation stack to update when the parameter is updated?
                            
                                Amazon AWS S3 404 error on page refresh
                            
                                How to mount EFS on a Lambda function?
                            
                                rsa public key No such file or directory?
                            
                                What is the difference between HEAD_REF vs BASE_REF in AWS Codebuild git webhook?
                            
                                AWS Lambda keeps returning "\"Hello from Lambda!\"
                            
                                PythonPipBuilder:ResolveDependencies - {numpy==1.20.3(wheel)}
                            
                                Is there a production grade SimpleDB .NET library? [closed]
                            
                                How can I use Amazon's API in PHP to search for books? [closed]
                            
                                EC2 EBS Snapshots as Incremental Backups [closed]
                            
                                Running a .config file on Elastic Beanstalk?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Amazon Redshift at 100% disk usage due to VACUUM query

Tags:

amazon-web-services

amazon-redshift

vacuum

Maxim Kogan

People also ask

2 Answers

Benjamin Crouzier

Rahul Gupta

Recent Activity

Donate For Us