Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

CSV to AVRO conversion in Azure

I am trying to convert csv files stored in azure data lake store into avro files with created scheme. Is there any kind of example source code which has same purpose?

like image 344
emkay Avatar asked Jan 04 '23 21:01

emkay


1 Answers

You can use Azure Data Lake Analytics for this. There is a sample Avro extractor at https://github.com/Azure/usql/blob/master/Examples/DataFormats/Microsoft.Analytics.Samples.Formats/Avro/AvroExtractor.cs. You can easily adapt the code into an outputter.

Another possibility is to fire up an HDInsight cluster on top of your data lake store and use Pig, Hive or Spark.

like image 57
Alexandre Gattiker Avatar answered Jan 16 '23 07:01

Alexandre Gattiker