I use fork/join in Oozie, in order to parallel some sub-workflow actions. My workflow.xml looks like this: <pre class="prettyprint"><code><workflow-app name="myName" xmlns="uri:oozie:workflow:0.5" <start to="fork1"/> <kill name="Kill"> <message>Action failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message> </kill> <fork name="fork1"> <path start="subworkflow1"/> <path start="subworkflow2"/> </fork> <join name="Completed" to="End" <action name="subworkflow1"> <sub-workflow> <app-path>....</app-path> <propagate-configuration/> <configuration> <property> <name>....</name> <value>....</value> </property> </configuration> </sub-workflow> <ok to="Completed"/> <error to="Completed"/> </action> <action name="subworkflow2"> <sub-workflow> <app-path>....</app-path> <propagate-configuration/> <configuration> <property> <name>....</name> <value>....</value> </property> </configuration> </sub-workflow> <ok to="Completed"/> <error to="Completed"/> </action> <end name="End"></workflow-app> </code></pre> When subworkflow1 is killed (failed for some reason), It kills subworkflow2 also. I want those two actions to be parallel, but not dependent. In my workflow, when workflow1 is killed, I see that workflow2 is also killed, but my app succeeded (I check it on Oozie dashboard -> workflows in HUE). In this case I want that subworkflow1 will be killed, subworkflow2 will succeed, and I don't really care what my entire app will say. <ul> <li>In my case, subworkflow1 takes longer than subworkflow2, so when I checked my app when it ended, I saw that although it says that subworkflow1+2 were killed, and my app succeeded, what really happened is that subworkflow2 finished its part and even though, it was killed later (it keeps 'running' until all the paths of the fork finish their run). So workflow2 finished its part and than was killed because workflow1 was killed...</li> </ul> What should I do to make each path to get it's own status and continue running even though other path in the same fork is killed?

I have recently run into this issue also. Found a way to get oozie to behave how I want. Your forked actions can have an error-to value equal to your join name. This will skip any subsequent action in that particular forked execution path. Then, your join's "to" value can send control to a decision node. That decision node should check value of <code>wf:lastErrorNode()</code>. If the value is empty string, continue on processing the workflow as needed. If the value is not empty string, then an error occurred and your can send control to kill node. Here's an example: <pre class="prettyprint"><code><start to="forkMe"/> <fork name="forkMe"> <path start="action1"/> <path start="action2"/> </fork> <action name="action1"> ... <ok to="joinMe"/> <error to="joinMe"/> </action> <action name="action1"> ... <ok to="joinMe"/> <error to="joinMe"/> </action> <join name="joinMe" to="decisionMe"/> <decision name="decisionMe"> <switch> <case to="end"> ${wf:lastErrorNode() eq ""} </case> <default to="error-mail"/> </switch> </decision> <action name="error-mail"> ... <ok to="fail"/> <error to="fail"/> </action> <kill name="fail"> <message>Job failed: message[${wf:errorMessage(wf:lastErrorNode())}] </message> </kill> <end name="end"/> </code></pre>

Oozie fork kills all actions when one is killed

Tags:

join

fork

parallel-processing

workflow

oozie

I use fork/join in Oozie, in order to parallel some sub-workflow actions. My workflow.xml looks like this:

<workflow-app name="myName" xmlns="uri:oozie:workflow:0.5"
<start to="fork1"/>
<kill name="Kill">
    <message>Action failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message>
    </kill>

<fork name="fork1">
    <path start="subworkflow1"/>
    <path start="subworkflow2"/>
</fork>
<join name="Completed" to="End"

<action name="subworkflow1">
    <sub-workflow>
        <app-path>....</app-path>
        <propagate-configuration/>
        <configuration>
            <property>
                <name>....</name>
                <value>....</value>
            </property>
        </configuration>
    </sub-workflow>
    <ok to="Completed"/>
    <error to="Completed"/>
</action>

<action name="subworkflow2">
    <sub-workflow>
        <app-path>....</app-path>
        <propagate-configuration/>
        <configuration>
            <property>
                <name>....</name>
                <value>....</value>
            </property>
        </configuration>
    </sub-workflow>
    <ok to="Completed"/>
    <error to="Completed"/>
</action>

<end name="End"></workflow-app>

When subworkflow1 is killed (failed for some reason), It kills subworkflow2 also. I want those two actions to be parallel, but not dependent.

In my workflow, when workflow1 is killed, I see that workflow2 is also killed, but my app succeeded (I check it on Oozie dashboard -> workflows in HUE).

In this case I want that subworkflow1 will be killed, subworkflow2 will succeed, and I don't really care what my entire app will say.

In my case, subworkflow1 takes longer than subworkflow2, so when I checked my app when it ended, I saw that although it says that subworkflow1+2 were killed, and my app succeeded, what really happened is that subworkflow2 finished its part and even though, it was killed later (it keeps 'running' until all the paths of the fork finish their run). So workflow2 finished its part and than was killed because workflow1 was killed...

What should I do to make each path to get it's own status and continue running even though other path in the same fork is killed?

748

asked Jul 08 '15 12:07

Hila Shifrin

1 Answers

I have recently run into this issue also. Found a way to get oozie to behave how I want.

Your forked actions can have an error-to value equal to your join name. This will skip any subsequent action in that particular forked execution path. Then, your join's "to" value can send control to a decision node. That decision node should check value of wf:lastErrorNode(). If the value is empty string, continue on processing the workflow as needed. If the value is not empty string, then an error occurred and your can send control to kill node.

Here's an example:

<start to="forkMe"/>
<fork name="forkMe">
    <path start="action1"/>
    <path start="action2"/>
</fork>
<action name="action1">
    ...
    <ok to="joinMe"/>
    <error to="joinMe"/>
</action>
<action name="action1">
    ...
    <ok to="joinMe"/>
    <error to="joinMe"/>
</action>
<join name="joinMe" to="decisionMe"/>
<decision name="decisionMe">
  <switch>
     <case to="end">
        ${wf:lastErrorNode() eq ""}
     </case>
     <default to="error-mail"/>
 </switch>
</decision>
<action name="error-mail">
    ...
    <ok to="fail"/>
    <error to="fail"/>
</action>
<kill name="fail">
    <message>Job failed:
        message[${wf:errorMessage(wf:lastErrorNode())}]
    </message>
</kill>
<end name="end"/>

113

answered Oct 10 '22 06:10

Jeffrey B

Related questions
                            
                                MySQL is not using INDEX in subquery
                            
                                Replacing Subqueries with Joins in MySQL
                            
                                Rails: NameError: uninitialized constant on join table
                            
                                Condition on joined table faster than condition on reference
                            
                                Rails Join Model select joint columns
                            
                                Laravel modify Auth::user() query?
                            
                                MySQL get the nearest future date to given date, from the dates located in different table having Common ID
                            
                                Merging multiple rows into single row PostgreSQL
                            
                                spark worker with 32GB or more memory encountered a fatal error
                            
                                Why does 'HASH JOIN' or 'LOOP JOIN' improve this stored proc?
                            
                                Order of tables in join query
                            
                                How to combine results from multiple tables with different columns?
                            
                                Adding virtual columns to current table in Doctrine?
                            
                                Advanced SQL Select Query
                            
                                Converting a doubly-nested query to a JOIN statement, and other optimizations
                            
                                Multiple joins/merges with data.tables
                            
                                SQL join left get MAX(date)
                            
                                SQL select query using joins, group by and aggregate functions
                            
                                Join tables by date range [duplicate]
                            
                                mysql select/delete using join over four tables

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With