I have a few Nifi process groups which I want to run integration tests on before promoting to production. The issue is that I can't seem to find any documentation on how to do so. Data Provenance seems like a promising tool to accomplish what I want, however, over the course of the flowfile's lifecycle, data is published to/from kafka or the file system. As a result, the flowfile UUID changes so I cannot query for it using the <code>nifi-api</code>. Additionally, I know that Nifi offers a <code>TestRunner</code> library to run tests, however, this seems to only be for processors/processor groups generated via code and not the UI. Does anyone know of a tool, framework, or pattern for integration and unit testing nifi process groups. Ideally this would be a solution where you can programatically compare input/output of the processor/processor group without modifying the existing workflow.

With the introduction of the Apache NiFi Registry, we have seen users promote flows from a development/sandbox environment to a test/QE environment where there are existing "test harness" flows surrounding the "flow under test" so that they can send repeatable and deterministic (or an anonymized sample of real production data) through the flow and compare the results to an expected value. As you point out, there is a <code>TestRunner</code> class and a whole testing framework provided for unit tests. While it can be difficult to manually translate a UI-constructed flow to the programmatic construction, you could also create something like a translator to accept a flow template or flow.xml.gz file and convert it into something processable by the test framework.

Integration and Unit testing Nifi process groups

Tags:

testing

apache-nifi

I have a few Nifi process groups which I want to run integration tests on before promoting to production. The issue is that I can't seem to find any documentation on how to do so.

Data Provenance seems like a promising tool to accomplish what I want, however, over the course of the flowfile's lifecycle, data is published to/from kafka or the file system. As a result, the flowfile UUID changes so I cannot query for it using the nifi-api.

Additionally, I know that Nifi offers a TestRunner library to run tests, however, this seems to only be for processors/processor groups generated via code and not the UI.

Does anyone know of a tool, framework, or pattern for integration and unit testing nifi process groups. Ideally this would be a solution where you can programatically compare input/output of the processor/processor group without modifying the existing workflow.

479

asked Aug 13 '18 22:08

bryce

1 Answers

With the introduction of the Apache NiFi Registry, we have seen users promote flows from a development/sandbox environment to a test/QE environment where there are existing "test harness" flows surrounding the "flow under test" so that they can send repeatable and deterministic (or an anonymized sample of real production data) through the flow and compare the results to an expected value.

As you point out, there is a TestRunner class and a whole testing framework provided for unit tests. While it can be difficult to manually translate a UI-constructed flow to the programmatic construction, you could also create something like a translator to accept a flow template or flow.xml.gz file and convert it into something processable by the test framework.

108

answered Oct 13 '22 20:10

Andy

Related questions
                            
                                Simulate missing package for testing?
                            
                                How to generate html report with gradle 1.12?
                            
                                How can this loop ever exit?
                            
                                Use multiple reporters in Mocha browser?
                            
                                How to tell pip to install test dependencies?
                            
                                Failed expectation: "Expected [ ] to be empty array."
                            
                                Protractor - Unable to run protractor tests
                            
                                Testing if object has multiple properties
                            
                                jasmine typeError is not a function
                            
                                Get SparkUncaughtExceptionHandler when run spark-perf
                            
                                How can I run integration tests after building all modules in a multi-module Maven project?
                            
                                Where do you store reusable mocks?
                            
                                Angular 2 testing: Async callback was not invoked within timeout specified by jasmine.DEFAULT_TIMEOUT_INTERVAL
                            
                                Conditional test to bypass pop up with Testcafe
                            
                                jest process.cwd() to get the test file directory
                            
                                testing non-exported methods in python
                            
                                Run tests on dynamic file in cypress.io
                            
                                Best practice to simulate exception for no space left on disk in Python with OpenCV
                            
                                Working with cypress redirect
                            
                                Using value of user defined variable in another user defined variable in JMeter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With