InternalError_: Spectrum Scan Error. S3 to Redshift copy command

Tags:

I am trying to copy some data from S3 bucket to redshift table by using the COPY command. The format of the file is PARQUET. When I run the execute the COPY command query, I get InternalError_: Spectrum Scan Error.

Error

This is the first time I tried copying from a parquet file.

Please help me if there is a solution for this. I am using boto3 in python.

245

asked Mar 29 '20 21:03

bazinga

2 Answers

This generally happens for below reasons:

If there is a mismatch in number of columns between table and file.
If the Column type of your file schema is incompatible with your target table column type.

Try going into the error logs. You might find partial log in cloud watch. From the screen shot you have uplaoded, you can also find a query number you have run.

Got to aws redshift query editor and run below query to get the full log:

select message 
from svl_s3log 
where query = '<<your query number>>'
order by query,segment,slice;

Hope this helps !

131

answered Sep 20 '22 14:09

Akhil Ghatiki

This error usually indicates some problem with compatibility of data in your file and redshift tables. you can get more insights about error in table 'SVL_S3LOG'. In my case it was because file had some invalid utf8 characters.

answered Sep 16 '22 14:09

Kamal Vashist

Related questions
                            
                                How to replace words in a string using a dictionary mapping
                            
                                Convenient way to handle deeply nested dictionary in Python
                            
                                TensorFlow: SparseSoftmaxCrossEntropyWithLogits Error?
                            
                                AWS Lambda (Python) Fails to unzip and store files in S3
                            
                                How To Save Keras Regressor Model?
                            
                                Store Python function in JSON
                            
                                django error __str__ returned non-string (type __proxy__)
                            
                                What is the secret behind Python's len() builtin time complexity of O(1) [closed]
                            
                                Adding Header to a DataFrame Pandas
                            
                                Extracting images from presentation file
                            
                                Find If Image Is Bright Or Dark
                            
                                Not able to replace the string containing $ in pandas column
                            
                                How to delete GCS folder from Python?
                            
                                count occurrences of list items in second list in python
                            
                                ModuleNotFoundError: No module named 'pyaudio' (Windows)
                            
                                Python: Compare two lists and get max and min values with signs
                            
                                How to speed up image loading in pillow (python)?
                            
                                ModuleNotFoundError: No module named 'python_jwt' (Raspberry Pi)
                            
                                I have installed python-dotenv but python cannot find it
                            
                                Can't import dll module in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

InternalError_: Spectrum Scan Error. S3 to Redshift copy command

Tags:

python

amazon-s3

parquet

amazon-redshift

bazinga

People also ask

2 Answers

Akhil Ghatiki

Kamal Vashist

Recent Activity

Donate For Us