How to set YARN queue for spark-shell?

1 Answers

You can set queue name, number of executors, executor memory, number of total cores, cores per executor, driver memory,etc when you start spark shell or spark-submit

here is how you can specify the parameters.

spark-shell --executor-memory 6G --executor-cores 5 --num-executors 20 --driver-memory 2G --queue $queue_name

You should be calculating these parameters as per your cluster capacity according to fat executor or thin executor concept.

If you still want to check resources utilization, you can check resource manager page or SPARK web UI page

170

answered Oct 05 '22 07:10

Rohit Nimmala

Related questions
                            
                                How to access local files in Spark on Windows?
                            
                                GenericRowWithSchema exception in casting ArrayBuffer to HashSet in DataFrame to RDD from Hive table
                            
                                Concatenate Sparse Vectors in Spark?
                            
                                JSON file parsing in Pyspark
                            
                                How to check if array column is inside another column array in PySpark dataframe
                            
                                Count number of columns in pyspark Dataframe?
                            
                                How to concatenate/append multiple Spark dataframes column wise in Pyspark?
                            
                                Spark _temporary creation reason
                            
                                How to convert empty arrays to nulls?
                            
                                Escape New line character in Spark CSV read
                            
                                Python pandas_udf spark error
                            
                                repartition() is not affecting RDD partition size
                            
                                Spark - write Avro file
                            
                                How to create a Dataset from custom class Person?
                            
                                Running Apache.Spark - log4j:WARN Please initialize the log4j system properly
                            
                                Store aggregate value of a PySpark dataframe column into a variable
                            
                                Spark: sum over list containing None and Some()?
                            
                                How to set up cluster environment for Spark applications on Windows machines?
                            
                                Avoiding multiple streaming queries
                            
                                Spark __getnewargs__ error ... Method or([class java.lang.String]) does not exist

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to set YARN queue for spark-shell?

Tags:

apache-spark

apache-spark-sql

user8167344

People also ask

1 Answers

Rohit Nimmala

Recent Activity

Donate For Us