Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

how to transfer data between Azure tables and Hadoop on Azure

I would like to import data from Azure table and then export to Hadoop cluster on Azure. Currently I am downloading the data from Azure table into .csv and then uploading the .csv to Hadoop cluster. I want to know whether there is any way to do it directly.

Thanks, Kiran

like image 464
user2136336 Avatar asked Dec 13 '25 16:12

user2136336


1 Answers

I have written a HiveStorageHandler which will let you create Hive Table which are backed by Azure Storage Tables. It's available on github:

https://github.com/simonellistonball/hive-azuretables

Note that you will also need the Microsoft Azure SDK for Java in your hadoop classpath for this to work. The easiest way to do that is probably to install it somewhere central like /usr/lib/hadoop/lib/.

If you do a mvn package on my repo, you should get something you can then load with a hive ADD JAR. The Readme for the repo contains a sample query showing how to create a Hive table that connects to Azure.

like image 136
Simon Elliston Ball Avatar answered Dec 16 '25 12:12

Simon Elliston Ball



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!