One of the requirement in the workflow I am working on is to wait for some event to happen for given time, if it does not happen mark the task as failed still the downstream task should be executed. I am wondering if "all_done" means all the dependency tasks are done no matter if they have succeeded or not.

SUMMARY The tasks are "all done" if the count of SUCCESS, FAILED, UPSTREAM_FAILED, SKIPPED tasks is greater than or equal to the count of all upstream tasks. Not sure why it would be greater than? Perhaps subdags do something weird to the counts. Tasks are "all success" if the count of upstream tasks and the count of success upstream tasks is the same. DETAILS The code for evaluating trigger rules is here https://github.com/apache/incubator-airflow/blob/master/airflow/ti_deps/deps/trigger_rule_dep.py#L72 <ol> <li>ALL_DONE</li> </ol> The following code runs the <code>qry</code> and returns the first row (the query is an aggregation that will only ever return one row anyway) into the following variables: <pre class="prettyprint"><code>successes, skipped, failed, upstream_failed, done = qry.first() </code></pre> the "done" column in the query corresponds to this: <code>func.count(TI.task_id)</code> in other words a count of all the tasks matching the filter. The filter specifies that it is counting only upstream tasks, from the current dag, from the current execution date and this: <pre class="prettyprint"><code> TI.state.in_([ State.SUCCESS, State.FAILED, State.UPSTREAM_FAILED, State.SKIPPED]) </code></pre> So <code>done</code> is a count of the upstream tasks with one of those 4 states. Later there is this code <pre class="prettyprint"><code>upstream = len(task.upstream_task_ids) ... upstream_done = done >= upstream </code></pre> And the actual trigger rule only fails on this <pre class="prettyprint"><code>if not upstream_done </code></pre> <ol start="2"> <li>ALL_SUCCESS</li> </ol> The code is fairly straightforward and the concept is intuitive <pre class="prettyprint"><code>num_failures = upstream - successes if num_failures > 0: ... it fails </code></pre>

What is the difference between airflow trigger rule "all_done" and "all_success"?

2 Answers

https://airflow.apache.org/docs/apache-airflow/stable/concepts/dags.html#concepts-trigger-rules

all_done means all operations have finished working. Maybe they succeeded, maybe not.

all_success means all operations have finished without error

So your guess is correct

answered Oct 16 '22 19:10

Sheena

SUMMARY
The tasks are "all done" if the count of SUCCESS, FAILED, UPSTREAM_FAILED, SKIPPED tasks is greater than or equal to the count of all upstream tasks.

Not sure why it would be greater than? Perhaps subdags do something weird to the counts.

Tasks are "all success" if the count of upstream tasks and the count of success upstream tasks is the same.

DETAILS
The code for evaluating trigger rules is here https://github.com/apache/incubator-airflow/blob/master/airflow/ti_deps/deps/trigger_rule_dep.py#L72

ALL_DONE

The following code runs the qry and returns the first row (the query is an aggregation that will only ever return one row anyway) into the following variables:

successes, skipped, failed, upstream_failed, done = qry.first()

the "done" column in the query corresponds to this: func.count(TI.task_id) in other words a count of all the tasks matching the filter. The filter specifies that it is counting only upstream tasks, from the current dag, from the current execution date and this:

 TI.state.in_([                     State.SUCCESS, State.FAILED,                     State.UPSTREAM_FAILED, State.SKIPPED])

So done is a count of the upstream tasks with one of those 4 states.

Later there is this code

upstream = len(task.upstream_task_ids) ... upstream_done = done >= upstream

And the actual trigger rule only fails on this

if not upstream_done

ALL_SUCCESS

The code is fairly straightforward and the concept is intuitive

num_failures = upstream - successes if num_failures > 0: ... it fails

answered Oct 16 '22 18:10

Davos

Related questions
                            
                                Argument passed to function must be callable, array given
                            
                                Why is `this` not available in C# 6.0 Auto-Property Initialization?
                            
                                Large file upload - Request gets cancelled
                            
                                pandas - convert string into list of strings [duplicate]
                            
                                Kubernetes: Pods Can't Resolve Hostnames
                            
                                Using Chart.js on Angular 4
                            
                                Can't change custom class of UIButton to GIDSignInButton
                            
                                How to mock Cookie.get('language') in JEST
                            
                                Spring JPA Projection findAll
                            
                                Using latest JavaScript features in TypeScript, such as ES2018
                            
                                Is it possible to make a parameterized scripted pipeline in Jenkins by listing the parameters in actual script, not in job config
                            
                                ARKit Plane Detection - Value of type 'ARView' has no member 'session'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the difference between airflow trigger rule "all_done" and "all_success"?

Tags:

samarth

People also ask

2 Answers

Sheena

Davos

Recent Activity

Donate For Us