I've loaded tab separated files into S3 that with this type of folders under the bucket: bucket --> se --> y=2013 --> m=07 --> d=14 --> h=00 each subfolder has 1 file that represent on hour of my traffic. I then created an EMR workflow to run in interactive mode with hive. When I log in to the master and get into hive I run this command: <pre class="prettyprint"><code>CREATE EXTERNAL TABLE se ( id bigint, oc_date timestamp) partitioned by (y string, m string, d string, h string) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' LOCATION 's3://bi_data'; </code></pre> I get this error message: <blockquote> FAILED: Error in metadata: java.lang.IllegalArgumentException: The bucket name parameter must be specified when listing objects in a bucket FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask </blockquote> Can anybody help? UPDATE Even if I try to use string fields only, I get the same error. Create table with strings: <pre class="prettyprint"><code>CREATE EXTERNAL TABLE se ( id string, oc_date string) partitioned by (y string, m string, d string, h string) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' LOCATION 's3://bi_data'; </code></pre> Hive version 0.8.1.8

So, the solution is that I had two mistakes: <ol> <li>When writing only the bucket name you should have a trailing slash in the S3 path. reference here</li> <li>The underscore is also an issue, the bucket name should be DNS compliant.</li> </ol> Hope I helped someone with this.

create hive table from tab separated file in s3 using interactive mode

Tags:

amazon-web-services

amazon-s3

hive

elastic-map-reduce

I've loaded tab separated files into S3 that with this type of folders under the bucket: bucket --> se --> y=2013 --> m=07 --> d=14 --> h=00

each subfolder has 1 file that represent on hour of my traffic.

I then created an EMR workflow to run in interactive mode with hive.

When I log in to the master and get into hive I run this command:

CREATE EXTERNAL TABLE se (
id bigint,
oc_date timestamp)
partitioned by (y string, m string, d string, h string)
ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t'
LOCATION 's3://bi_data';

I get this error message:

FAILED: Error in metadata: java.lang.IllegalArgumentException: The bucket name parameter must be specified when listing objects in a bucket

FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask

Can anybody help?

UPDATE Even if I try to use string fields only, I get the same error. Create table with strings:

CREATE EXTERNAL TABLE se (
id string,
oc_date string)
partitioned by (y string, m string, d string, h string)
ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t'
LOCATION 's3://bi_data';

Hive version 0.8.1.8

634

asked Jul 14 '13 13:07

Gluz

1 Answers

So, the solution is that I had two mistakes:

When writing only the bucket name you should have a trailing slash in the S3 path. reference here
The underscore is also an issue, the bucket name should be DNS compliant.

Hope I helped someone with this.

answered Sep 27 '22 19:09

Gluz

Related questions
                            
                                How do I make this IAM role error in aws sagemaker go away?
                            
                                How to install Numpy and Pandas for AWS Lambdas?
                            
                                Rotating RDS secrets in AWS with open connections
                            
                                Terraform - Specifying multiple possible values for Variables
                            
                                AWS Cloudwatch can not publish to SNS Topic with SSE
                            
                                Error on run: Property validation failure: [Value of property {/Targets/0/Values} does not match type {Array}]
                            
                                Did not find the image definition file imagedefinitions.json
                            
                                Creating Cognito User Pool With Custom Domain name from AWS CDK
                            
                                How do I create a Presigned URL to download a file from an S3 Bucket using Boto3?
                            
                                How can I invoke another lambda function also defined in AWS SAM template?
                            
                                Amazon HTTP API gateway not working via VPC Link [closed]
                            
                                How to list all the stored procedure in AWS RedShift
                            
                                How to use SSH Tunnel to connect to an RDS instance via an EC2 instance?
                            
                                How to integrate API Gateway with s3 in CDK
                            
                                How can I have multiple concurrent AWS console sessions logged in to multiple account/role combinations?
                            
                                Amazon AWS S3 to Force Download Mp3 File instead of Stream It
                            
                                How to automatically snapshot a volume of an Amazon EC2 instance?
                            
                                Posting form data to Amazon S3 bucket
                            
                                Can I set a url to my EC2 instance instead of the IP?
                            
                                AWS EC2: generating private key file out of cert-***.pem for SSH terminal access

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

create hive table from tab separated file in s3 using interactive mode

Tags:

amazon-web-services

amazon-s3

hive

elastic-map-reduce

Gluz

People also ask

1 Answers

Gluz

Recent Activity

Donate For Us