AWS Athena: use "folder" name as partition

2 Answers

It is possible to do this now using storage.location.template. This will partition by some part of your path. Be sure to NOT include the new column in the column list, as it will automatically be added. There are a lot of options you can search to tweak this for your date example. I used "id" to show the simplest version i could think of.

CREATE EXTERNAL TABLE `some_table`(
  `col1` bigint, 
PARTITIONED BY (
  `id` string
  )
ROW FORMAT SERDE 
  'org.openx.data.jsonserde.JsonSerDe' 
STORED AS INPUTFORMAT 
  'org.apache.hadoop.mapred.TextInputFormat' 
OUTPUTFORMAT 
  'org.apache.hadoop.hive.ql.io.IgnoreKeyTextOutputFormat'
LOCATION
  's3://path/bucket/'
TBLPROPERTIES (
  'has_encrypted_data'='false',
  'projection.enabled'='true', 
  'projection.id.type' = 'injected',
  'storage.location.template'='s3://path/bucket/${id}/'
  )

official docs: https://docs.amazonaws.cn/en_us/athena/latest/ug/partition-projection-dynamic-id-partitioning.html

129

answered Sep 17 '22 16:09

Jeremy Giaco

Its not necessary to do this manually. Setup a glue crawler and it will pick-up the folder( in the prefix) as a partition, if all the folders in the path has the same structure and all the data has the same schema design.

Put it will name the partition as partition0. You can go into edit-schema and change the name of this partition to date or whatever you like.

But make sure you go into your glue crawler and under "configuration options" select the option - "Add new columns only". Otherwise on the next glue-crawler run it will reset the partition name back to partition0.

answered Sep 21 '22 16:09

Venkat.V.S

Related questions
                            
                                Getting Error "cannot use immediate apply method for static parameter"
                            
                                Which role is attached to instance
                            
                                Why do we need a Private Subnet + NAT translation in AWS? Can't we just use a Public Subnet + a properly configured security group?
                            
                                Geospatial query in DynamoDB
                            
                                Connect to MySQL database from Lambda function (Node)
                            
                                AWS Socket Not created by this factory
                            
                                How to implement AWS IoT(device) in React-Native?
                            
                                AWS DynamoDB BatchWriteItem - Write Capacity Units
                            
                                How to integrate AWS Secret Manager with Spring Boot Application
                            
                                "storage/logs/laravel-2019-11-22.log" could not be opened: failed to open stream: Permission denied
                            
                                AWS Lambda IP address ranges
                            
                                Access files in s3n://elasticmapreduce/samples/wordcount/input
                            
                                SSL problems with S3/AWS using the Java API: "hostname in certificate didn't match"
                            
                                AWS DynamoDB Query Call (with no results) Cost
                            
                                How can I sync two amazon buckets using the AWS CLI?
                            
                                zsh: parse error near `\n' when Adding AWS keys as environment variables
                            
                                How to pre-warm CloudFront edge servers' cache?
                            
                                Are AWS S3 Event Notifications guaranteed to be delivered?
                            
                                How can I serve static web site from s3 through node expressjs?
                            
                                New IAM admin user sees "You are not authorized to perform this operation"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

AWS Athena: use "folder" name as partition

Tags:

amazon-web-services

amazon-s3

amazon-athena

Raphael

People also ask

2 Answers

Jeremy Giaco

Venkat.V.S

Recent Activity

Donate For Us