Can I load data from splitted gzip files into an amazon redshift table?
I can load data from single gzip file or from splitted text files.
But can I load data from splitted gzip files?
I'm assuming here that you mean that you have multiple CSV files that are each gzipped.
First, upload each file to an S3 bucket under the same prefix and delimiter.
s3://S3_BUCKET/S3_PREFIX/file0.gz
s3://S3_BUCKET/S3_PREFIX/file1.gz
Then, execute the Redshift copy command:
copy TABLE_NAME from 's3://S3_BUCKET/S3_PREFIX' credentials 'aws_access_key_id=ACCESS_KEY;aws_secret_access_key=SECRET_KEY' csv gzip
Also, it's a good idea to have your file count proportional to the number of nodes in your cluster.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With