I am currently evaluating Amazon Athena and Amazon S3. I have created a database (testdb) with one table (awsevaluationtable). The table has two columns, x (bigint) and y (bigint). When I run: <pre class="prettyprint"><code>SELECT * FROM testdb."awsevaluationtable" </code></pre> I get all of the test data: <img src="https://i.stack.imgur.com/nEevs.png" alt="Successful Query"> However, when I try a basic WHERE query: <pre class="prettyprint"><code>SELECT * FROM testdb."awsevaluationtable" WHERE x > 5 </code></pre> I get: <pre class="prettyprint"><code>SYNTAX_ERROR: line 3:7: Column 'x' cannot be resolved </code></pre> I have tried all sorts of variations: <pre class="prettyprint"><code>SELECT * FROM testdb.awsevaluationtable WHERE x > 5 SELECT * FROM awsevaluationtable WHERE x > 5 SELECT * FROM testdb."awsevaluationtable" WHERE X > 5 SELECT * FROM testdb."awsevaluationtable" WHERE testdb."awsevaluationtable".x > 5 SELECT * FROM testdb.awsevaluationtable WHERE awsevaluationtable.x > 5 </code></pre> I have also confirmed that the x column exists with: <pre class="prettyprint"><code>SHOW COLUMNS IN sctawsevaluation </code></pre> <img src="https://i.stack.imgur.com/02pVz.png" alt="Column query"> This seems like an extremely simple query yet I can't figure out what is wrong. I don't see anything obvious in the documentation. Any suggestions would be appreciated.

In my case, changing double quotes to single quotes resolves this error. Presto uses single quotes for string literals, and uses double quotes for identifiers. https://trino.io/docs/current/migration/from-hive.html#use-ansi-sql-syntax-for-identifiers-and-strings <blockquote> Strings are delimited with single quotes and identifiers are quoted with double quotes, not backquotes: <pre class="prettyprint lang-sql prettyprint-override"><code>SELECT name AS "User Name" FROM "7day_active" WHERE name = 'foo' </code></pre> </blockquote>

Amazon Athena - Column cannot be resolved on basic SQL WHERE query

Tags:

amazon-web-services

amazon-s3

amazon-athena

I am currently evaluating Amazon Athena and Amazon S3. I have created a database (testdb) with one table (awsevaluationtable). The table has two columns, x (bigint) and y (bigint).

When I run:

SELECT * 
FROM testdb."awsevaluationtable"

I get all of the test data: Successful Query

However, when I try a basic WHERE query:

SELECT * 
FROM testdb."awsevaluationtable" 
WHERE x > 5

I get:

SYNTAX_ERROR: line 3:7: Column 'x' cannot be resolved

I have tried all sorts of variations:

SELECT * FROM testdb.awsevaluationtable WHERE x > 5
SELECT * FROM awsevaluationtable WHERE x > 5
SELECT * FROM testdb."awsevaluationtable" WHERE X > 5
SELECT * FROM testdb."awsevaluationtable" WHERE testdb."awsevaluationtable".x > 5
SELECT * FROM testdb.awsevaluationtable WHERE awsevaluationtable.x > 5

I have also confirmed that the x column exists with:

SHOW COLUMNS IN sctawsevaluation

Column query

This seems like an extremely simple query yet I can't figure out what is wrong. I don't see anything obvious in the documentation. Any suggestions would be appreciated.

590

asked Aug 22 '18 19:08

Joel

2 Answers

In my case, changing double quotes to single quotes resolves this error.

Presto uses single quotes for string literals, and uses double quotes for identifiers.

https://trino.io/docs/current/migration/from-hive.html#use-ansi-sql-syntax-for-identifiers-and-strings

Strings are delimited with single quotes and identifiers are quoted with double quotes, not backquotes:
SELECT name AS "User Name"
FROM "7day_active"
WHERE name = 'foo'

166

answered Sep 17 '22 14:09

nekketsuuu

I have edited my response to this issue based on my current findings and my contact with both the AWS Glue and Athena support teams.

We were having the same issue - an inability to query on the first column in our CSV files. The problem comes down to the encoding of the CSV file. In short, AWS Glue and Athena currently do not support CSV's encoded in UTF-8-BOM. If you open up a CSV encoded with a Byte Order Mark (BOM) in Excel or Notepad++, it looks like any comma-delimited text file. However, opening it up in a Hex editor reveals the underlying issue. There are a bunch of special characters at the start of the file: ï»¿ i.e. the BOM.

When a UTF-8-BOM CSV file is processed in AWS Glue, it retains these special characters, and associates then with the first column name. When you try and query on the first column within Athena, you will generate an error.

There are ways around this on AWS:

In AWS Glue, edit the table schema and delete the first column, then reinsert it back with the proper column name, OR
In AWS Athena, execute the SHOW CREATE TABLE DDL to script out the problematic table, remove the special character in the generated script, then run the script to create a new table which you can query on.

To make your life simple, just make sure your CSV's are encoded as UTF-8.

answered Sep 20 '22 14:09

owl7

Related questions
                            
                                How to redirect AWS sdk logging output
                            
                                (AWS/EC2/EBS) Why does "Delete on Termination" default to true for root devices?
                            
                                Differences between 'root account credentials' and 'IAM user credentials'
                            
                                How to ensure AWS Elastic Beanstalk is free
                            
                                CloudFormation Stack Errors with REST API Doesn't Contain Any Methods
                            
                                AWS CDK -- Cannot find module '@aws-cdk/aws-ec2'
                            
                                Route53 and Cloudfront The request could not be satisfied?
                            
                                Map multiple subdomains to same S3-bucket
                            
                                Increase the root device size in cloudformation autoscaling group
                            
                                Allowing third party users to upload files into your AWS S3 fs [closed]
                            
                                AWS Powershell to retrieve AWS account number
                            
                                How to talk to aws elasticsearch service using elastic java client?
                            
                                Multiple AWS Lambda functions on a Single DynamoDB Stream
                            
                                How to attach and mount volumes to an EC2 instance using CloudFormation
                            
                                Is it possible to assign a public IP to a Lambda function in AWS?
                            
                                Company account & personal account in AWS
                            
                                AWS SAM - How to specify the name of your function
                            
                                What actions does job.commit perform in aws glue?
                            
                                Parse an AWS CloudFormation template with the PyYAML library
                            
                                Is it possible to write to s3 via a stream using s3 java sdk

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With