I'm trying to migrating some MySQL tables to Amazon Redshift, but met some problems. The steps are simple: 1. Dump the MySQL table to a csv file 2. Upload the csv file to S3 3. Copy the data file to RedShift Error occurs in step 3: The SQL command is: <blockquote> copy TABLE_A from 's3://ciphor/TABLE_A.csv' CREDENTIALS 'aws_access_key_id=xxxx;aws_secret_access_key=xxxx' delimiter ',' csv; </blockquote> The error info: <blockquote> An error occurred when executing the SQL command: copy TABLE_A from 's3://ciphor/TABLE_A.csv' CREDENTIALS 'aws_access_key_id=xxxx;aws_secret_access_key=xxxx ERROR: COPY CSV is not supported [SQL State=0A000] Execution time: 0.53s 1 statement(s) failed. </blockquote> I don't know if there's any limitations on the format of the csv file, say the delimiters and quotes, I cannot find it in documents. Any one can help?

Now Amazon Redshift supports CSV option for COPY command. It's better to use this option to import CSV formatted data correctly. The format is shown bellow. <pre class="prettyprint"><code>COPY [table-name] FROM 's3://[bucket-name]/[file-path or prefix]' CREDENTIALS 'aws_access_key_id=xxxx;aws_secret_access_key=xxxx' CSV; </code></pre> The default delimiter is ( , ) and the default quotes is ( " ). Also you can import TSV formatted data with CSV and DELIMITER option like this. <pre class="prettyprint"><code>COPY [table-name] FROM 's3://[bucket-name]/[file-path or prefix]' CREDENTIALS 'aws_access_key_id=xxxx;aws_secret_access_key=xxxx' CSV DELIMITER '\t'; </code></pre> There are some disadvantages to use the old way(DELIMITER and REMOVEQUOTES) that REMOVEQUOTES does not support to have a new line or a delimiter character within an enclosed filed. If the data can include this kind of characters, you should use CSV option. See the following link for the details. http://docs.aws.amazon.com/redshift/latest/dg/r_COPY.html

How to copy csv data file to Amazon RedShift?

Tags:

mysql

csv

amazon-redshift

I'm trying to migrating some MySQL tables to Amazon Redshift, but met some problems.

The steps are simple: 1. Dump the MySQL table to a csv file 2. Upload the csv file to S3 3. Copy the data file to RedShift

Error occurs in step 3:

The SQL command is:

copy TABLE_A from 's3://ciphor/TABLE_A.csv' CREDENTIALS 'aws_access_key_id=xxxx;aws_secret_access_key=xxxx' delimiter ',' csv;

The error info:

An error occurred when executing the SQL command: copy TABLE_A from 's3://ciphor/TABLE_A.csv' CREDENTIALS 'aws_access_key_id=xxxx;aws_secret_access_key=xxxx ERROR: COPY CSV is not supported [SQL State=0A000] Execution time: 0.53s 1 statement(s) failed.

I don't know if there's any limitations on the format of the csv file, say the delimiters and quotes, I cannot find it in documents.

Any one can help?

978

asked Mar 07 '13 02:03

ciphor

4 Answers

The problem is finally resolved by using:

copy TABLE_A from 's3://ciphor/TABLE_A.csv' CREDENTIALS 'aws_access_key_id=xxxx;aws_secret_access_key=xxxx' delimiter ',' removequotes;

More information can be found here http://docs.aws.amazon.com/redshift/latest/dg/r_COPY.html

166

answered Nov 02 '22 04:11

ciphor

Now Amazon Redshift supports CSV option for COPY command. It's better to use this option to import CSV formatted data correctly. The format is shown bellow.

COPY [table-name] FROM 's3://[bucket-name]/[file-path or prefix]'
CREDENTIALS 'aws_access_key_id=xxxx;aws_secret_access_key=xxxx' CSV;

The default delimiter is ( , ) and the default quotes is ( " ). Also you can import TSV formatted data with CSV and DELIMITER option like this.

COPY [table-name] FROM 's3://[bucket-name]/[file-path or prefix]'
CREDENTIALS 'aws_access_key_id=xxxx;aws_secret_access_key=xxxx' CSV DELIMITER '\t';

There are some disadvantages to use the old way(DELIMITER and REMOVEQUOTES) that REMOVEQUOTES does not support to have a new line or a delimiter character within an enclosed filed. If the data can include this kind of characters, you should use CSV option.

See the following link for the details.

http://docs.aws.amazon.com/redshift/latest/dg/r_COPY.html

answered Nov 02 '22 05:11

Masashi M

If you want to save your self some code/ you have a very basic use case you can use Amazon Data Pipeline. it stats a spot instance and perform the transformation within amazon network and it's really intuitive tool (but very simple so you can't do complex things with it)

answered Nov 02 '22 05:11

asafm

You can try with this

copy TABLE_A from 's3://ciphor/TABLE_A.csv' CREDENTIALS 'aws_access_key_id=xxxx;aws_secret_access_key=xxxx' csv;

CSV itself means comma separated values, no need to provide delimiter with this. Please refer link.

[http://docs.aws.amazon.com/redshift/latest/dg/copy-parameters-data-format.html#copy-format]

answered Nov 02 '22 06:11

Dipesh Palod

Related questions
                            
                                ER_CON_COUNT_ERROR: Too many connections error in node-mysql
                            
                                Scan error: unsupported Scan, storing driver.Value type <nil> into type *string
                            
                                PHP Web Application: mysql database design best practices question
                            
                                Does replace into have a where clause?
                            
                                How to transform vertical data into horizontal data with SQL?
                            
                                mysql separating tables
                            
                                MYSQL CASE THEN statement with multiple values
                            
                                Designing an E-Commerce Database - MySQL
                            
                                MySQL error #1054 - Unknown column in 'Field List'
                            
                                How do you set "max_allowed_packet" in XAMPP? [closed]
                            
                                normalizing accented characters in MySQL queries
                            
                                What is the best search Algorithm for PHP & MYSQL? [closed]
                            
                                Is it odd that my SQLAlchemy MySQL connection always ends up sleeping?
                            
                                How to store bidirectional relationships in a RDBMS like MySQL?
                            
                                Why does MySQL round floats way more than expected?
                            
                                What is the best way to insert multiple rows in PHP PDO MYSQL?
                            
                                How to get ID of the last inserted row in MySQL and Go?
                            
                                SQL_CALC_FOUND_ROWS / FOUND_ROWS() does not work in PHP
                            
                                MySQL view performance
                            
                                MySQL error #2014 - Commands out of sync; you can't run this command now

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to copy csv data file to Amazon RedShift?

Tags:

mysql

csv

amazon-redshift

ciphor

People also ask

4 Answers

ciphor

Masashi M

asafm

Dipesh Palod

Recent Activity

Donate For Us