I am trying to run a crawler across an s3 datastore in my account which contains two csv files. However, when I try to run the crawler, no tables are loaded, and I see the following errors in cloudwatch for the each of the files: <ul> <li>Error Access Denied (Service: Amazon S3; Status Code: 403; Error Code: AccessDenied;</li> <li>Tables created did not infer schemas from this file.</li> </ul> This is especially odd as the IAM role has the AdministratorAccess policy attached, so there should not be any access denied issue. Any help would be appreciated.

Check to see if the files you are crawling are encrypted. If they are, then your Glue role probably doesn't have a policy that allows it to decrypt. If so, it might need something like this: <pre class="prettyprint"><code>{ "Version": "2012-10-17", "Statement": { "Effect": "Allow", "Action": [ "kms:Decrypt" ], "Resource": [ "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab", "arn:aws:kms:us-west-2:111122223333:key/0987dcba-09fe-87dc-65ba-ab0987654321" ] } } </code></pre>

AWS Glue Access denied for crawler with administrator policy attached

Tags:

amazon-s3

aws-glue

I am trying to run a crawler across an s3 datastore in my account which contains two csv files. However, when I try to run the crawler, no tables are loaded, and I see the following errors in cloudwatch for the each of the files:

Error Access Denied (Service: Amazon S3; Status Code: 403; Error Code: AccessDenied;
Tables created did not infer schemas from this file.

This is especially odd as the IAM role has the AdministratorAccess policy attached, so there should not be any access denied issue.

Any help would be appreciated.

274

asked Aug 17 '18 16:08

Jon Swanson

1 Answers

Check to see if the files you are crawling are encrypted. If they are, then your Glue role probably doesn't have a policy that allows it to decrypt.

If so, it might need something like this:

{
  "Version": "2012-10-17",
  "Statement": {
    "Effect": "Allow",
    "Action": [
      "kms:Decrypt"
    ],
    "Resource": [
      "arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab",
      "arn:aws:kms:us-west-2:111122223333:key/0987dcba-09fe-87dc-65ba-ab0987654321"
    ]
  }
}

answered Nov 12 '22 21:11

Andy Zoutte

Related questions
                            
                                AWS S3 cmd to automatically make file public while uploading
                            
                                Split S3 file into smaller files of 1000 lines
                            
                                "s3cmd get" rewrites local files
                            
                                Unable to download AWS CodeDeploy Agent Install file
                            
                                S3 PUT doesn't work with pre-signed URL in javascript
                            
                                How to set ACL of all files in a folder in S3
                            
                                how to update webpages of site hosted on aws s3?
                            
                                Copy to S3 with AWS CLI with proper content type
                            
                                Why does S3 bucket ARN not contain AWS account number?
                            
                                Python: How to read and load an excel file from AWS S3?
                            
                                Retrieve binary data from S3 storage through AWS.NET in C#
                            
                                Direct Uploads to S3 using Carrierwave
                            
                                Using Boto to connect to S3 with Python
                            
                                FineUploader: S3 Access Denied Response when Canned ACL value is changed
                            
                                Specify byte range via query string in Get Object S3 request
                            
                                Uploading blob file to Amazon s3
                            
                                How to improve SEO for Serverless Websites?
                            
                                Python - How to read CSV file retrieved from S3 bucket?
                            
                                Terraform error refreshing state access denied
                            
                                Read a file line by line using Lambda / S3

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With