Connecting Athena and S3 in same Cloudformation Stack

Tags:

From the documentation, AWS::Athena::NamedQuery, it is unclear how to attach Athena to an S3 bucket specified in the same stack.

If I had to guess from the example, I would imagine that you can write a template like,

Resources:
  MyS3Bucket:
    Type: AWS::S3::Bucket
       ... other params ...

  AthenaNamedQuery:
    Type: AWS::Athena::NamedQuery
    Properties:
      Database: "db_name"
      Name: "MostExpensiveWorkflow"
      QueryString: >
                    CREATE EXTERNAL TABLE db_name.test_table 
                    (...) LOCATION s3://.../path/to/folder/

Would a template like the above work? Upon stack creation, will the table db_name.test_table be available to run queries on?

523

asked Oct 05 '17 22:10

ignorance

1 Answers

Turns out the way you connect the S3 and Athena is to make a Glue table! How silly of me!! Of course Glue is how you connect things!

Sarcasm aside, this is a template that worked for me when using AWS::Glue::Table and AWS::Glue::Database,

Resources:
  MyS3Bucket:
    Type: AWS::S3::Bucket

  MyGlueDatabase:
    Type: AWS::Glue::Database
    Properties:
      DatabaseInput:
        Name: my-glue-database
        Description: "Glue beats tape"
      CatalogId: !Ref AWS::AccountId

  MyGlueTable:
    Type: AWS::Glue::Table
    Properties:
      DatabaseName: !Ref MyGlueDatabase
      CatalogId: !Ref AWS::AccountId
      TableInput:
        Name: my-glue-table
        Parameters: { "classification" : "csv" }
        StorageDescriptor:
          Location:
            Fn::Sub: "s3://${MyS3Bucket}/"
          InputFormat: "org.apache.hadoop.mapred.TextInputFormat"
          OutputFormat: "org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat"
          SerdeInfo:
            Parameters: { "separatorChar" : "," }
            SerializationLibrary: "org.apache.hadoop.hive.serde2.OpenCSVSerde"
          StoredAsSubDirectories: false
          Columns:
            - Name: column0
              Type: string
            - Name: column1
              Type: string

After this, the database and table were in the AWS Athena Console!

183

answered Sep 24 '22 02:09

ignorance

Related questions
                            
                                How to return HTML page stored on AWS S3 bucket from a AWS Lambda function
                            
                                AWS Elastic Beanstalk Environment Variables in Python
                            
                                Cloudformation SQS Policy for S3 events
                            
                                Python | How to parse JSON from results from AWS response?
                            
                                aws dynamodb free tier practical limit
                            
                                How to get the authenticated user's email address in node.js AWS Lambda?
                            
                                C# implementation of AWS API Gateway Custom Authorization Lambda
                            
                                Alexa Skill - Update Intent programmatically
                            
                                Invoke Lambda from CodePipeline with multiple UserParameters
                            
                                What is the "fully qualified name of my Amazon EC2 instance"
                            
                                Alamofire image upload with PUT
                            
                                AWS Cognito NotAuthorizedException A client attempted to write unauthorized attribute
                            
                                Is there any way to get the Cognito username in AWS Lambda?
                            
                                dynamodb attribute name compression
                            
                                "Message":"Your request: '/_cluster/allocation/reroute' is not allowed."}
                            
                                Amazon Polly Implementation Using PHP SDK
                            
                                How to CloudWatch Laravel logs deployed in AWS Elastic Beanstalk?
                            
                                Send a notification by Lambda function with AWS Pinpoint
                            
                                AWS ATHENA: user-defined variables
                            
                                Changing "Origin Path" in CloudFront takes very long to kick in

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Connecting Athena and S3 in same Cloudformation Stack

Tags:

amazon-web-services

amazon-s3

amazon-cloudformation

amazon-athena

ignorance

People also ask

1 Answers

ignorance

Recent Activity

Donate For Us