How to configure Apache Spark random worker ports for tight firewalls?

Tags:

I am using Apache Spark to run machine learning algorithms and other big data tasks. Previously, I was using spark cluster standalone mode running spark master and worker on the same machine. Now, I added multiple worker machines and due to a tight firewall, I have to edit the random port of worker. Can anyone help how to change random spark ports and tell me exactly what configuration file needs to be edited? I read the spark documentation and it says spark-defaults.conf should be configured but I don't know how I can configure this file for particularly changing random ports of spark.

771

asked Jan 01 '15 07:01

Isma Khan

1 Answers

Update for Spark 2.x

Some libraries have been rewritten from scratch and many legacy *.port properties are now obsolete (cf. SPARK-10997 / SPARK-20605 / SPARK-12588 / SPARK-17678 / etc)

For Spark 2.1, for instance, the port ranges on which the driver will listen for executor traffic are

between spark.driver.port and spark.driver.port+spark.port.maxRetries
between spark.driver.blockManager.port and spark.driver.blockManager.port+spark.port.maxRetries

And the port range on which the executors will listen for driver traffic and/or other executors traffic is

between spark.blockManager.port and spark.blockManager.port+spark.port.maxRetries

The "maxRetries" property allows for running several Spark jobs in parallel; if the base port is already used, then the new job will try the next one, etc, unless the whole range is already used.

Source:
https://spark.apache.org/docs/2.1.1/configuration.html#networking
https://spark.apache.org/docs/2.1.1/security.html under "Configuring ports"

answered Oct 02 '22 08:10

Samson Scharfrichter

Related questions
                            
                                How do you enable file specific tab indent settings in VIM?
                            
                                Reload configuration settings from an external config file during run-time
                            
                                How to configure and enable Azure Service Fabric Reverse Proxy for an existing on-premises cluster?
                            
                                How to set a default query timeout with JPA and Hibernate?
                            
                                What's the correct wildcard syntax to copy TeamCity artifacts to the root of a destination path?
                            
                                Can't Compile C++ Code on NetBeans 7.0
                            
                                Firefox can't connect to a local site, but Chrome can
                            
                                Nginx proxy or rewrite depending on user agent
                            
                                Multiple localhost:80-bound sites in IIS?
                            
                                Mercurial: enable git subrepo
                            
                                TypeInitializationException when starting Windows Service because config section can not be created
                            
                                How to include a config file for .Net
                            
                                Allowing extra, undefined options in a config array when using Symfony2's Configuration class
                            
                                Why VIM backup filenames are not correct? 'backupdir' option not performing as expected
                            
                                bad configuration option in git
                            
                                How do I set the timezone in Tomcat for a single web app?
                            
                                Play Framework GUID
                            
                                Making configuration node support both string and array in Symfony 2 configuration?
                            
                                Meaning that is displayed by [alternatives --config mta] is what?
                            
                                Is there a way to list all active ESLint / Prettier rules in an Angular project?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to configure Apache Spark random worker ports for tight firewalls?

Tags:

configuration

apache-spark

ports

worker

Isma Khan

People also ask

1 Answers

Samson Scharfrichter

Recent Activity

Donate For Us