Slurm: What is the difference for code executing under salloc vs srun

Tags:

I'm using a cluster managed by slurm to run some yarn/hadoop benchmarks. To do this I am starting the hadoop servers on nodes allocated by slurm and then running the benchmarks on them. I realize that this is not the intended way to run a production hadoop cluster, but needs must.

To do this I started by writing a script that runs with srun eg srun -N 4 setup.sh. This script writes the configuration files and starts the servers on the allocated nodes, with the lowest numbered machine acting as the master. This all works, and I am able to run applications.

However, as I would like to start the servers once and then launch multiple applications on them without restarting/encoding everything in at the begining I would like to use salloc instead. I had thought that this would be a simple case of running salloc -N 4 and then running srun setup.sh. Unfortunately this does not work as the different servers are unable to communicate with each other. Could any one explain to me what the difference in the operating environment is between using srun and using salloc then srun?

Many thanks

Daniel

905

asked Mar 03 '14 16:03

Daniel Goodman

1 Answers

From the slurm-users mailing list:

sbatch and salloc allocate resources to the job, while srun launches parallel tasks across those resources. When invoked within a job allocation, srun will launch parallel tasks across some or all of the allocated resources. In that case, srun inherits by default the pertinent options of the sbatch or salloc which it runs under. You can then (usually) provide srun different options which will override what it receives by default. Each invocation of srun within a job is known as a job step.

srun can also be invoked outside of a job allocation. In that case, srun requests resources, and when those resources are granted, launches tasks across those resources as a single job and job step.

167

answered Sep 23 '22 20:09

Prashant Singh

Related questions
                            
                                What are the differences between Sort Comparator and Group Comparator in Hadoop?
                            
                                Can't build Hadoop 2.4.1 with Java8
                            
                                hadoop/hdfs/name is in an inconsistent state: storage directory(hadoop/hdfs/data/) does not exist or is not accessible
                            
                                what are the disadvantages of mapreduce?
                            
                                Read whole text files from a compression in Spark
                            
                                Good tutorial on how install Hadoop 2.2.0 (Yarn) as single node cluster on MacOS [closed]
                            
                                Difference in calling the job
                            
                                Hive dynamic partitioning
                            
                                Unable to exit Hive
                            
                                /bin/bash: /bin/java: No such file or directory error in Yarn apps in MacOS
                            
                                Insert data into hive table
                            
                                Partition columns when inserting into a Hive table from a select
                            
                                All three constructors of org.apache.hadoop.mapreduce.Job are deprecated, what is the best way to construct a Job class?
                            
                                Hadoop Streaming : Chaining Jobs
                            
                                Spark: run InputFormat as singleton
                            
                                Spark - java IOException :Failed to create local dir in /tmp/blockmgr*
                            
                                Hadoop - FileSystem.listFiles - not listing directories
                            
                                MultipleOutputFormat in hadoop
                            
                                Dealing with an incompatible version change of a serialization framework

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Slurm: What is the difference for code executing under salloc vs srun

Tags:

hadoop

hadoop-yarn

slurm

Daniel Goodman

People also ask

1 Answers

Prashant Singh

Recent Activity

Donate For Us