How to safely unload/copy a table in RedShift?

Tags:

amazon-redshift

In RedShift, it is convenient to use unload/copy to move data to S3 and load back to redshift, but I feel it is hard to choose the delimiter each time. The right delimiter is relevant to the content of the table! I had to change the delimiter each time I met load errors.

For example, when I use the following command to unload/copy a table:

unload ('select * from tbl_example') to 's3://s3bucket/tbl_example' CREDENTIALS 'aws_access_key_id=xxx;aws_secret_access_key=xxx' delimiter '|' addquotes allowoverwrite;

copy tbl_example2 from 's3://s3bucket/tbl_example' CREDENTIALS 'aws_access_key_id=xxx;aws_secret_access_key=xxx' delimiter '|' removequotes;

I will get load error if the table happens to have a field with its content as "||". Then I have to change the delimiter '|' to another one like ',' and try again, if I'm unlucky, maybe it takes multiple tries to get a success.

I'm wondering if there's a way to unload/copy a redshift table which is irrelevant to the content of the table, which will always succeed no mater what weird strings are stored in the table.

659

asked Oct 16 '14 17:10

ciphor

2 Answers

Finally I figured out the right approach, to add escape in both unload and copy command:

unload ('select * from tbl_example') to 's3://s3bucket/tbl_example' CREDENTIALS 'aws_access_key_id=xxx;aws_secret_access_key=xxx' delimiter '|' addquotes escape allowoverwrite;

copy tbl_example2 from 's3://s3bucket/tbl_example' CREDENTIALS 'aws_access_key_id=xxx;aws_secret_access_key=xxx' delimiter '|' removequotes escape;

With escape in unload command, for CHAR and VARCHAR columns in delimited unload files, an escape character (\) is placed before every occurrence of the following characters:

Linefeed: \n
Carriage return: \r
The delimiter character specified for the unloaded data.
The escape character: \
A quote character: " or ' (if both ESCAPE and ADDQUOTES are specified in the UNLOAD command).

And with escape in copy command, the backslash character () in input data is treated as an escape character. The character that immediately follows the backslash character is loaded into the table as part of the current column value, even if it is a character that normally serves a special purpose. For example, you can use this option to escape the delimiter character, a quote, an embedded newline, or the escape character itself when any of these characters is a legitimate part of a column value.

answered Oct 10 '22 23:10

ciphor

Try unload like below

 unload ('select * from tbl_example') to 's3://s3bucket/tbl_example' CREDENTIALS 'aws_access_key_id=xxx;aws_secret_access_key=xxx' delimiter as ',' addquotes escape

To load it back use as below

copy tbl_example2 from 's3://s3bucket/tbl_example' CREDENTIALS 'aws_access_key_id=xxx;aws_secret_access_key=xxx' delimiter ',' removequotes escape;

This will work irrespective of your data might have , in between.

answered Oct 10 '22 21:10

Sandesh Deshmane

Related questions
                            
                                Time Difference in Redshift
                            
                                redshift: count distinct customers over window partition
                            
                                Redshift unload's file name
                            
                                temporarily shut down redshift to reduce bill
                            
                                List of external schemas and tables from Amazon Redshift
                            
                                How to create a dependency list for an object in Redshift?
                            
                                Mongodb to redshift
                            
                                How to select multiple rows filled with constants in Amazon Redshift?
                            
                                Redshift table update with join
                            
                                Gradle/Maven dependency for Redshift JDBC driver
                            
                                First and last day of last month in Redshift
                            
                                How can I limit DBeaver Data Editor to limit result set size?
                            
                                How to assign table ownership to multiple users or group in Redshift
                            
                                Convert text to timestamp in redshift
                            
                                How to find out what are the privileges granted to a specific group in redshift
                            
                                Copying a table from one redshift cluster to another redshift cluster(without using s3)
                            
                                Loading JSON data to AWS Redshift results in NULL values
                            
                                glue job for redshift connection: "Unable to find suitable security group"
                            
                                Is there any way to find table creation date in redshift?
                            
                                Redshift query between date

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to safely unload/copy a table in RedShift?

Tags:

amazon-redshift

ciphor

People also ask

2 Answers

ciphor

Sandesh Deshmane

Recent Activity

Donate For Us