Redshift COPY command delimiter not found

Tags:

I'm trying to load some text files to Redshift. They are tab delimited, except for after the final row value. That's causing a delimiter not found error. I only see a way to set the field delimiter in the COPY statement, not a way to set a row delimiter. Any ideas that don't involve processing all my files to add a tab to the end of each row?

Thanks

492

asked Feb 18 '14 18:02

Erik Darling

3 Answers

I don't think the problem is with missing <tab> at the end of lines. Are you sure that ALL lines have correct number of fields?

Run the query:

select le.starttime, d.query, d.line_number, d.colname, d.value,
le.raw_line, le.err_reason    
from stl_loaderror_detail d, stl_load_errors le
where d.query = le.query
order by le.starttime desc
limit 100

to get the full error report. It will show the filename with errors, incorrect line number, and error details.

This will help to find where the problem lies.

103

answered Sep 20 '22 15:09

Tomasz Tybulewicz

You can get the delimiter not found error if your row has less columns than expected. Some CSV generators may just output a single quote at the end if last columns are null.

To solve this you can use FILLRECORD on Redshift copy options.

answered Sep 19 '22 15:09

Madhava Carrillo

From my understanding the error message Delimiter not found may be caused also by not specifying correctly the COPY command, in particular by not specifying the Data format parameters https://docs.aws.amazon.com/redshift/latest/dg/r_COPY.html

In my case I was trying to load Parquet data with this expression:

COPY my_schema.my_table
FROM 's3://my_bucket/my/folder/'
IAM_ROLE 'arn:aws:iam::my_role:role/my_redshift_role'
REGION 'my-region-1';

and I received the Delimiter not found error message when looking into the system table stl_load_errors. But specifying I'm dealing with Parquet data in the expression in this way:

COPY my_schema.my_table
FROM 's3://my_bucket/my/folder/'
IAM_ROLE 'arn:aws:iam::my_role:role/my_redshift_role'
FORMAT AS PARQUET;

solved my problem and I was able to correctly load the data.

answered Sep 21 '22 15:09

Vzzarr

Related questions
                            
                                Spring boot startup error for AWS application : There is not EC2 meta data available
                            
                                Using aws cli, what is best way to determine the current region
                            
                                Unable to create a stage in AWS API Gateway
                            
                                Reading data from S3 using Lambda
                            
                                aws lambda function triggering multiple times for a single event
                            
                                Node.js maxing out at 1000 concurrent connections
                            
                                What's the target group port for, when using Application Load Balancer + EC2 Container Service
                            
                                Adding AWS Lambda with VPC configuration causes timeout when accessing S3
                            
                                AWS CLI S3: copying file locally using the terminal : fatal error: An error occurred (404) when calling the HeadObject operation
                            
                                How I can work with Amazon's Dynamodb Local in Node?
                            
                                PuTTYgen doesn't give me the option for SSH-2 RSA
                            
                                What does 'cpu' parameter mean in aws container service?
                            
                                Using Amazon SQS with multiple consumers
                            
                                Getting S3 objects' last modified datetimes with boto
                            
                                Access denied to SQS via AWS SDK
                            
                                How to convert a boto3 Dynamo DB item to a regular dictionary in Python?
                            
                                AWS cloud formation Template- providing Tags for the stack in the template
                            
                                how to copy s3 object from one bucket to another using python boto3
                            
                                AWS API Gateway No 'Access-Control-Allow-Origin' header is present
                            
                                installing mod_ssl amazon linux

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Redshift COPY command delimiter not found

Tags:

amazon-web-services

amazon-redshift