Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Apache Nifi - Extract Attributes From Avro

I'm trying to get my head around on extracting attributes from Avro and JSON. I'm able to extract attributes from JSON by using EvaluateJsonPath processor. I'm trying to do the same on Avro, but i'm not sure whether it is achievable.

Here is my flow, ExecuteSQL -> SplitAvro -> UpdateAttribute

UpdateAttribute is the processor where i want to extract the attributes. Please find below snapshot of UpdateAttribute processor,

UpdateAttribute Processor COnfiguration

So, my basic question is, could we extract attributes form Avro? If yes, please provide me the right approach. Or is it necessary to use ConvertAvroToJSON always before extracting the attributes?

like image 272
Pons Avatar asked Feb 27 '17 22:02

Pons


1 Answers

Currently, there is no way in NiFi to extract attributes directly from Avro (there is not yet an AvroPath like XPath for XML or JsonPath for JSON) so as you said you can use ConvertAvroToJSON before extracting the attributes.

Alternatively, I wrote a Groovy script for use in an ExecuteScript processor, it takes "Avro path" values as dynamic properties (each starting with avro.path and whose value is really JsonPath), does the conversion of Avro to JSON in memory, and requires you download and point to the Avro JARs. I can post it here if you are interested, but really its only advantage is to maintain the flow file content in Avro, and although it might be annoying, you could use ConvertAvroToJson -> EvaluateJsonPath -> ConvertJsonToAvro as the workaround.

like image 112
mattyb Avatar answered Oct 17 '22 15:10

mattyb