I'm trying to use YARN node labels to tag worker nodes, but when I run applications on YARN (Spark or simple YARN app), those applications cannot start. <ul> <li>with Spark, when specifying <code>--conf spark.yarn.am.nodeLabelExpression="my-label"</code>, the job cannot start (blocked on <code>Submitted application [...]</code>, see details below).</li> <li>with a YARN application (like <code>distributedshell</code>), when specifying <code>-node_label_expression my-label</code>, the application cannot start neither</li> </ul> Here are the tests I have made so far. <h3>YARN node labels setup</h3> I'm using Google Dataproc to run my cluster (example : 4 workers, 2 on preemptible nodes). My goal is to force any YARN application master to run on a non-preemptible node, otherwise the node can be shutdown at any time, thus making the application fail hard. I'm creating the cluster using YARN properties (<code>--properties</code>) to enable node labels : <pre class="prettyprint"><code>gcloud dataproc clusters create \ my-dataproc-cluster \ --project [PROJECT_ID] \ --zone [ZONE] \ --master-machine-type n1-standard-1 \ --master-boot-disk-size 10 \ --num-workers 2 \ --worker-machine-type n1-standard-1 \ --worker-boot-disk-size 10 \ --num-preemptible-workers 2 \ --properties 'yarn:yarn.node-labels.enabled=true,yarn:yarn.node-labels.fs-store.root-dir=/system/yarn/node-labels' </code></pre> Versions of packaged Hadoop and Spark : <ul> <li>Hadoop version : 2.8.2</li> <li>Spark version : 2.2.0</li> </ul> After that, I create a label (<code>my-label</code>), and update the two non-preemptible workers with this label : <pre class="prettyprint"><code>yarn rmadmin -addToClusterNodeLabels "my-label(exclusive=false)" yarn rmadmin -replaceLabelsOnNode "\ [WORKER_0_NAME].c.[PROJECT_ID].internal=my-label \ [WORKER_1_NAME].c.[PROJECT_ID].internal=my-label" </code></pre> I can see the created label in YARN Web UI : <img src="https://i.stack.imgur.com/pXwnj.png" alt="Label created on YARN"> <h3>Spark</h3> When I run a simple example (<code>SparkPi</code>) without specifying info about node labels : <pre class="prettyprint"><code>spark-submit \ --class org.apache.spark.examples.SparkPi \ --master yarn \ --deploy-mode client \ /usr/lib/spark/examples/jars/spark-examples.jar \ 10 </code></pre> In the Scheduler tab on YARN Web UI, I see the application launched on <code><DEFAULT_PARTITION>.root.default</code>. But when I run the job specifying <code>spark.yarn.am.nodeLabelExpression</code> to set the location of the Spark application master : <pre class="prettyprint"><code>spark-submit \ --class org.apache.spark.examples.SparkPi \ --master yarn \ --deploy-mode client \ --conf spark.yarn.am.nodeLabelExpression="my-label" \ /usr/lib/spark/examples/jars/spark-examples.jar \ 10 </code></pre> The job is not launched. From YARN Web UI, I see : <ul> <li> YarnApplicationState: <code>ACCEPTED: waiting for AM container to be allocated, launched and register with RM.</code> </li> <li> Diagnostics: <code>Application is Activated, waiting for resources to be assigned for AM. Details : AM Partition = my-label ; Partition Resource = <memory:6144, vCores:2> ; Queue's Absolute capacity = 0.0 % ; Queue's Absolute used capacity = 0.0 % ; Queue's Absolute max capacity = 0.0 % ;</code> </li> </ul> I suspect that the queue related to the label partition (not <code><DEFAULT_PARTITION</code>, the other one) does not have sufficient resources to run the job : <img src="https://i.stack.imgur.com/fmIIH.png" alt="Spark job accepted"> Here, <code>Used Application Master Resources</code> is <code><memory:1024, vCores:1></code>, but the <code>Max Application Master Resources</code> is <code><memory:0, vCores:0></code>. That explains why the application cannot start, but I can't figure out how to change this. I tried to update different parameters, but without success : <pre class="prettyprint"><code>yarn.scheduler.capacity.root.default.accessible-node-labels=my-label </code></pre> Or increasing those properties : <pre class="prettyprint"><code>yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.capacity yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.maximum-capacity yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.maximum-am-resource-percent yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.user-limit-factor yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.minimum-user-limit-percent </code></pre> without success neither. <h3>YARN Application</h3> The issue is the same when running a YARN application : <pre class="prettyprint"><code>hadoop jar \ /usr/lib/hadoop-yarn/hadoop-yarn-applications-distributedshell.jar \ -shell_command "echo ok" \ -jar /usr/lib/hadoop-yarn/hadoop-yarn-applications-distributedshell.jar \ -queue default \ -node_label_expression my-label </code></pre> The application cannot start, and the logs keeps repeating : <code>INFO distributedshell.Client: Got application report from ASM for, appId=6, clientToAMToken=null, appDiagnostics= Application is Activated, waiting for resources to be assigned for AM. Details : AM Partition = my-label ; Partition Resource = <memory:6144, vCores:2> ; Queue's Absolute capacity = 0.0 % ; Queue's Absolute used capacity = 0.0 % ; Queue's Absolute max capacity = 0.0 % ; , appMasterHost=N/A, appQueue=default, appMasterRpcPort=-1, appStartTime=1520354045946, yarnAppState=ACCEPTED, distributedFinalState=UNDEFINED, [...]</code> If I don't specify <code>-node_label_expression my-label</code>, the application start on <code><DEFAULT_PARTITION>.root.default</code> and succeed. <h3>Questions</h3> <ul> <li>Am I doing something wrong with the labels? However, I followed the official documentation and this guide </li> <li>Is this a specific problem related to Dataproc? Because the previous guides seems to work on other environments</li> <li>Maybe I need to create a specific queue and associate it with my label? But since I'm running a "one-shot" cluster to run a single Spark job I don't need to have specific queues, running jobs on the default root one is not a problem for my use-case</li> </ul> Thanks for helping

A Google engineer answered us (on a private issue we raised, not in the PIT), and gave us a solution by specifying an initialization script to Dataproc cluster creation. I don't think the issue comes from Dataproc, this is basically just YARN configuration. The script sets the following properties in <code>capacity-scheduler.xml</code>, just after creating the node label (<code>my-label</code>) : <pre class="prettyprint"><code><property> <name>yarn.scheduler.capacity.root.accessible-node-labels</name> <value>my-label</value> </property> <property> <name>yarn.scheduler.capacity.root.accessible-node-labels.my-label.capacity</name> <value>100</value> </property> <property> <name>yarn.scheduler.capacity.root.default.accessible-node-labels</name> <value>my-label</value> </property> <property> <name>yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.capacity</name> <value>100</value> </property> </code></pre> From the comment going along with the script, this "set <code>accessible-node-labels</code> on both <code>root</code> (the root queue) and <code>root.default</code> (the default queue applications actually get run on)". The <code>root.default</code> part is what was missing in my tests. Capacity for both is set to 100. Then, restarting YARN (<code>systemctl restart hadoop-yarn-resourcemanager.service</code>) is needed to validate the modifications. After that, I was able to start jobs that failed to complete in my question. Hope that will help people having the same issues or similar.

YARN applications cannot start when specifying YARN node labels

YARN node labels setup

I'm using Google Dataproc to run my cluster (example : 4 workers, 2 on preemptible nodes). My goal is to force any YARN application master to run on a non-preemptible node, otherwise the node can be shutdown at any time, thus making the application fail hard.

I'm creating the cluster using YARN properties (--properties) to enable node labels :

gcloud dataproc clusters create \
    my-dataproc-cluster \
    --project [PROJECT_ID] \
    --zone [ZONE] \
    --master-machine-type n1-standard-1 \
    --master-boot-disk-size 10 \
    --num-workers 2 \
    --worker-machine-type n1-standard-1 \
    --worker-boot-disk-size 10 \
    --num-preemptible-workers 2 \
    --properties 'yarn:yarn.node-labels.enabled=true,yarn:yarn.node-labels.fs-store.root-dir=/system/yarn/node-labels'

Versions of packaged Hadoop and Spark :

Hadoop version : 2.8.2
Spark version : 2.2.0

After that, I create a label (my-label), and update the two non-preemptible workers with this label :

yarn rmadmin -addToClusterNodeLabels "my-label(exclusive=false)"
yarn rmadmin -replaceLabelsOnNode "\
    [WORKER_0_NAME].c.[PROJECT_ID].internal=my-label \
    [WORKER_1_NAME].c.[PROJECT_ID].internal=my-label"

I can see the created label in YARN Web UI :

Label created on YARN

Spark

When I run a simple example (SparkPi) without specifying info about node labels :

spark-submit \
  --class org.apache.spark.examples.SparkPi \
  --master yarn \
  --deploy-mode client \
  /usr/lib/spark/examples/jars/spark-examples.jar \
  10

In the Scheduler tab on YARN Web UI, I see the application launched on <DEFAULT_PARTITION>.root.default.

But when I run the job specifying spark.yarn.am.nodeLabelExpression to set the location of the Spark application master :

spark-submit \
    --class org.apache.spark.examples.SparkPi \
    --master yarn \
    --deploy-mode client \
    --conf spark.yarn.am.nodeLabelExpression="my-label" \
    /usr/lib/spark/examples/jars/spark-examples.jar \
    10

The job is not launched. From YARN Web UI, I see :

YarnApplicationState: ACCEPTED: waiting for AM container to be allocated, launched and register with RM.
Diagnostics: Application is Activated, waiting for resources to be assigned for AM. Details : AM Partition = my-label ; Partition Resource = <memory:6144, vCores:2> ; Queue's Absolute capacity = 0.0 % ; Queue's Absolute used capacity = 0.0 % ; Queue's Absolute max capacity = 0.0 % ;

I suspect that the queue related to the label partition (not <DEFAULT_PARTITION, the other one) does not have sufficient resources to run the job :

Spark job accepted

Here, Used Application Master Resources is <memory:1024, vCores:1>, but the Max Application Master Resources is <memory:0, vCores:0>. That explains why the application cannot start, but I can't figure out how to change this.

I tried to update different parameters, but without success :

yarn.scheduler.capacity.root.default.accessible-node-labels=my-label

Or increasing those properties :

yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.capacity
yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.maximum-capacity
yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.maximum-am-resource-percent
yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.user-limit-factor
yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.minimum-user-limit-percent

without success neither.

YARN Application

The issue is the same when running a YARN application :

hadoop jar \
    /usr/lib/hadoop-yarn/hadoop-yarn-applications-distributedshell.jar \
    -shell_command "echo ok" \
    -jar /usr/lib/hadoop-yarn/hadoop-yarn-applications-distributedshell.jar \
    -queue default \
    -node_label_expression my-label

The application cannot start, and the logs keeps repeating :

INFO distributedshell.Client: Got application report from ASM for, appId=6, clientToAMToken=null, appDiagnostics= Application is Activated, waiting for resources to be assigned for AM. Details : AM Partition = my-label ; Partition Resource = <memory:6144, vCores:2> ; Queue's Absolute capacity = 0.0 % ; Queue's Absolute used capacity = 0.0 % ; Queue's Absolute max capacity = 0.0 % ; , appMasterHost=N/A, appQueue=default, appMasterRpcPort=-1, appStartTime=1520354045946, yarnAppState=ACCEPTED, distributedFinalState=UNDEFINED, [...]

If I don't specify -node_label_expression my-label, the application start on <DEFAULT_PARTITION>.root.default and succeed.

Questions

Am I doing something wrong with the labels? However, I followed the official documentation and this guide
Is this a specific problem related to Dataproc? Because the previous guides seems to work on other environments
Maybe I need to create a specific queue and associate it with my label? But since I'm running a "one-shot" cluster to run a single Spark job I don't need to have specific queues, running jobs on the default root one is not a problem for my use-case

Thanks for helping

612

asked Mar 07 '18 09:03

norbjd

1 Answers

A Google engineer answered us (on a private issue we raised, not in the PIT), and gave us a solution by specifying an initialization script to Dataproc cluster creation. I don't think the issue comes from Dataproc, this is basically just YARN configuration. The script sets the following properties in capacity-scheduler.xml, just after creating the node label (my-label) :

<property>
  <name>yarn.scheduler.capacity.root.accessible-node-labels</name>
  <value>my-label</value>
</property>
<property>
  <name>yarn.scheduler.capacity.root.accessible-node-labels.my-label.capacity</name>
  <value>100</value>
</property>
<property>
  <name>yarn.scheduler.capacity.root.default.accessible-node-labels</name>
  <value>my-label</value>
</property>
<property>
  <name>yarn.scheduler.capacity.root.default.accessible-node-labels.my-label.capacity</name>
  <value>100</value>
</property>

From the comment going along with the script, this "set accessible-node-labels on both root (the root queue) and root.default (the default queue applications actually get run on)". The root.default part is what was missing in my tests. Capacity for both is set to 100.

Then, restarting YARN (systemctl restart hadoop-yarn-resourcemanager.service) is needed to validate the modifications.

After that, I was able to start jobs that failed to complete in my question.

Hope that will help people having the same issues or similar.

answered Oct 24 '22 19:10

norbjd

Related questions
                            
                                How to pull data in the Map/Reduce functions?
                            
                                Installing PIG on single node
                            
                                What is mean by implementing a advanced job control framework to help chain multiple Map-Reduce jobs?
                            
                                Distributing Data Nodes Across Multiple Data Centers
                            
                                Missing Hive Execution Jar: /usr/local/hadoop/hive/lib/hive-exec-*.jar
                            
                                pig to hadoop issue: Server IPC version 7 cannot communicate with client version 4
                            
                                Impala cannot find com.mysql.jdbc.Driver
                            
                                MapReduce job in headless environment fails N times due to AM Container exception from container-launch
                            
                                JVM crashes with no frame specified, only "timer expired, abort"
                            
                                How to insert data into Parquet table in Hive
                            
                                hdfs log file is too huge
                            
                                Cannot validate serde : org.openx.data.jsonserde.jsonserde
                            
                                Resources/Documentation on how does the failover process work for the Spark Driver (and its YARN Container) in yarn-cluster mode
                            
                                Python package installation: pip vs yum, or both together?
                            
                                Jackson throwing errors
                            
                                HTableDescriptor(table) in hbase is deprecated and alternative for that?
                            
                                Join Tables on Date Range in Hive
                            
                                Hive Utf-8 Encoding number of characters supported?
                            
                                Data shuffle for Hive and Spark window function
                            
                                Read large mongodb data

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

YARN applications cannot start when specifying YARN node labels

Tags:

apache-spark

hadoop

hadoop-yarn

google-cloud-dataproc

YARN node labels setup

Spark

YARN Application

Questions

norbjd

People also ask

1 Answers

norbjd

Recent Activity

Donate For Us