What is use of hcatalog in hadoop?

2 Answers

In short, HCatalog opens up the hive metadata to other mapreduce tools. Every mapreduce tools has its own notion about HDFS data (example Pig sees the HDFS data as set of files, Hive sees it as tables). With having table based abstraction, HCatalog supported mapreduce tools do not need to care about where the data is stored, in which format and storage location (HBase or HDFS).

We do get the facility of WebHcat to submit jobs in an RESTful way if you configure webhcat along Hcatalog.

answered Oct 19 '22 18:10

Prabu Soundar Rajan

Here is a very basic example of how ho use HCATALOG.

I have a table in hive ,TABLE NAME is STUDENT which is stored in one of the HDFS location:

neethu 90 malini 90 sunitha 98 mrinal 56 ravi 90 joshua 8

Now suppose I want to load this table to pig for further transformation of data, In this scenario I can use HCATALOG:

When using table information from the Hive metastore with Pig, add the -useHCatalog option when invoking pig:

pig -useHCatalog

(you may want to export HCAT_HOME 'HCAT_HOME=/usr/lib/hive-hcatalog/')

Now loading this table to pig: A = LOAD 'student' USING org.apache.hcatalog.pig.HCatLoader();

Now you have loaded the table to pig.To check the schema , just do a DESCRIBE on the relation.

DESCRIBE A

Thanks

answered Oct 19 '22 20:10

Neethu Lalitha

Related questions
                            
                                Using gmail smtp via Laravel: Connection could not be established with host smtp.gmail.com [Connection timed out #110]
                            
                                Laravel 5 Auth logout not working
                            
                                Android Studio 2.2 Google play services sync Error
                            
                                "document is not defined" in Nuxt.js
                            
                                How to read in numbers as command arguments?
                            
                                How do i configure nginx to redirect to a url for robots.txt & sitemap.xml
                            
                                Java I/O streams; what are the differences?
                            
                                Getting pair-set using LINQ
                            
                                iPhone + UITableView + row height
                            
                                RVM is not working over SSH
                            
                                How to add objects to an NSArray using for loop?
                            
                                How to split a long string without breaking words?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is use of hcatalog in hadoop?

Tags:

Vijay_Shinde

People also ask

2 Answers

Prabu Soundar Rajan

Neethu Lalitha

Recent Activity

Donate For Us