Are there downsides for creating a large VARCHAR value in Redshift?

Tags:

The source data keeps throwing values for the field that keep getting bigger and bigger in length. Right now I'm using VARCHAR(200) but I might go for VARCHAR(400). Are there any disadvantages using a large number?

641

asked May 14 '15 23:05

simplycoding

2 Answers

Don’t make it a practice to use the maximum column size for convenience.

Instead, consider the largest values you are likely to store in a VARCHAR column, for example, and size your columns accordingly. Because Amazon Redshift compresses column data very effectively, creating columns much larger than necessary has minimal impact on the size of data tables. During processing for complex queries, however, intermediate query results might need to be stored in temporary tables. Because temporary tables are not compressed, unnecessarily large columns consume excessive memory and temporary disk space, which can affect query performance.

http://docs.aws.amazon.com/redshift/latest/dg/c_best-practices-smallest-column-size.html

148

answered Oct 26 '22 17:10

Yuri Levinsky

What do you mean "downside"? There is a really big downside if you don't make the column big enough -- you can't use it to store the values you want to store there.

As for additional overhead, you don't need to worry about that. A varchar() type basically only takes up the storage needed for the value, plus a small overhead for the length. Also, "400" is not such a big number, especially when compared to "200".

So, if you need 400 bytes to store the value, change the table to store it. There may be overhead for changing the length of the value. I'm not sure if RedShift will feel the need to copy the data because the type changed. However, the effect on performance should be negligible.

answered Oct 26 '22 19:10

Gordon Linoff

Related questions
                            
                                How to optimize this in MySQL?
                            
                                T-SQL procedure - filter parameter as Object/CLR/Xml/UDT
                            
                                SQL evaluation of IF clauses
                            
                                Cannot find data type date in SQL Server 2005
                            
                                sql date select
                            
                                MSSQL Most optimal query for date range for a day, or day range
                            
                                SQL Table Foreign Key that is part of a Composite Primary Key
                            
                                In SQL Server, do composite primary keys increase the chance of a deadlock?
                            
                                Why am I getting a "[SQL0802] Data conversion of data mapping error" exception?
                            
                                Spring named parameters: how can I parameterize Oracle interval in my query?
                            
                                SQL: LIKE vs = bug? [duplicate]
                            
                                Unexpected INSERT ... SET query behavior
                            
                                Is SQL order by clause guaranteed to be stable ( by Standards)
                            
                                SQL using CASE in SELECT with GROUP BY. Need CASE-value but get row-value
                            
                                Is count(*) constant time in SQLite, and if not what are alternatives?
                            
                                Oracle: How to reference an alias with a space from a subquery in a comparison operation
                            
                                TransactionScope has aborted transaction before disposal
                            
                                Select rows until condition met
                            
                                Using inner Join in Solr query
                            
                                PostgreSQL -> SQLite: DATE_TRUNC Equivalent

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With