Do Redshift column encodings affect query execution speed?

Question

When creating data tables in Amazon Redshift, you can specify various encodings such as MOSTLY32 or BYTEDICT or LZO. Those are the compressions used when storing the columnar values on disk.

I am wondering if my choice of encoding is supposed to make a difference in query execution times. For example, if I make a column BYTEDICT would that make a difference over LZO when it comes to SELECTs, GROUP BYs or FILTERs?

Rakesh Singh · Accepted Answer

Yes. The compression encoding used translates to amount of disk storage. Generally, the lower the storage the better would be query performance.

But, which encoding would be be more beneficial to you depends on your data type and its distribution. There is no gurantee that LZO will always be better than Bytedict or vice-a-versa. In my experience, I usually load some sample data in the intended table. Than do a analyze compression. Now whatever Redshift suggests, I go with it. That has worked for me.

Do Redshift column encodings affect query execution speed?

Tags:

amazon-redshift

Mendhak

1 Answers

Rakesh Singh

Recent Activity

Donate For Us

Do Redshift column encodings affect query execution speed?

Tags:

amazon-redshift

Mendhak

1 Answers

Rakesh Singh

Related questions

Recent Activity

Donate For Us