Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Does master node execute actual tasks in Spark?

Tags:

apache-spark

My question may sound silly, but it bothers me for a long time.

enter image description here

The picture shown above is the components of a distributed Spark application. I think this picture indicates that the master node will never execute actual tasks, but only is served as a cluster manager. Is it true?

By the way, the tasks here refers to the user-submit tasks.

like image 544
chenzhongpu Avatar asked Apr 09 '26 09:04

chenzhongpu


1 Answers

Yes, the master node executes the driver process and does not run tasks. Tasks run in executor processes on the worker nodes. The master node is rarely stressed from a CPU standpoint but, depending on how broadcast variables, accumulators and collect are used, it may be quite stressed in terms of RAM usage.

like image 75
Sim Avatar answered Apr 11 '26 03:04

Sim



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!