What is the relationship between workers, worker instances, and executors?

1 Answers

I suggest reading the Spark cluster docs first, but even more so this Cloudera blog post explaining these modes.

Your first question depends on what you mean by 'instances'. A node is a machine, and there's not a good reason to run more than one worker per machine. So two worker nodes typically means two machines, each a Spark worker.

Workers hold many executors, for many applications. One application has executors on many workers.

Your third question is not clear.

144

answered Oct 13 '22 05:10

Sean Owen

Related questions
                            
                                How to perform union on two DataFrames with different amounts of columns in spark?
                            
                                Errors when using OFF_HEAP Storage with Spark 1.4.0 and Tachyon 0.6.4
                            
                                How to check the Spark version
                            
                                How do I skip a header from CSV files in Spark?
                            
                                how to loop through each row of dataFrame in pyspark
                            
                                Spark code organization and best practices [closed]
                            
                                How do I convert an array (i.e. list) column to Vector
                            
                                How to join on multiple columns in Pyspark?
                            
                                How does createOrReplaceTempView work in Spark?
                            
                                Create Spark DataFrame. Can not infer schema for type: <type 'float'>
                            
                                What is the difference between spark checkpoint and persist to a disk
                            
                                How to use Column.isin with list?
                            
                                Querying Spark SQL DataFrame with complex types
                            
                                How to make good reproducible Apache Spark examples
                            
                                How to use JDBC source to write and read data in (Py)Spark?
                            
                                Cannot find col function in pyspark
                            
                                pyspark dataframe filter or include based on list
                            
                                how to filter out a null value from spark dataframe
                            
                                How to find median and quantiles using Spark
                            
                                Pyspark: Split multiple array columns into rows

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the relationship between workers, worker instances, and executors?

Tags:

apache-spark

apache-spark-standalone

edwardsbean

People also ask

1 Answers

Sean Owen

Recent Activity

Donate For Us