Election of new zookeeper leader shuts down the Spark Master

Tags:

I realized that the master spark becomes unresponsive when I kill the leader zookeeper (of course I assigned the leader election task to the zookeeper). The following is the error log that I see on Master Spark node. Do you have any suggestion to resolve it?

15/06/22 10:44:00 INFO ClientCnxn: Unable to read additional data from
> server sessionid 0x14dd82e22f70ef1, likely server has closed socket,
> closing socket connection and attempting reconnect 

15/06/22 10:44:00
> INFO ClientCnxn: Unable to read additional data from server sessionid
> 0x24dc5a319b40090, likely server has closed socket, closing socket
> connection and attempting reconnect 

15/06/22 10:44:01 INFO
> ConnectionStateManager: State change: SUSPENDED 

15/06/22 10:44:01 INFO
> ConnectionStateManager: State change: SUSPENDED 

15/06/22 10:44:01 WARN
> ConnectionStateManager: There are no ConnectionStateListeners
> registered. 

15/06/22 10:44:01 INFO ZooKeeperLeaderElectionAgent: We
> have lost leadership 

15/06/22 10:44:01 ERROR Master: Leadership has
> been revoked -- master shutting down.

636

asked Jun 22 '15 17:06

Arad

1 Answers

This is the expected behaviour. You have to set up 'n' number of masters and you need to specify the zookeeper url in all the master env.sh

SPARK_DAEMON_JAVA_OPTS="-Dspark.deploy.recoveryMode=ZOOKEEPER -Dspark.deploy.zookeeper.url=zk1:2181,zk2:2181"

Note that zookeeper maintains quorum. This means you need to have odd number of zookeepers and only when the quorum is maintained zookeeper cluster will be up. Since spark depends up on zookeeper it implies that spark cluster will not be up until zookeeper quorum is maintained.

When you set up two(n) masters and bring down a zookeeper the current master will go down and the new master will be elected and all the worker nodes will be attached to the new master.

You should have started your worker by giving

./start-slave.sh spark://master1:port1,master2:port2

You have to wait for 1-2 minutes!! to notice this failover.

answered Sep 21 '22 14:09

Knight71

Related questions
                            
                                Selecting only numeric/string columns names from a Spark DF in pyspark
                            
                                How to allocate more executors per worker in Standalone cluster mode?
                            
                                PySpark - Adding a Column from a list of values using a UDF
                            
                                spark partition data writing by timestamp
                            
                                Invalid Spark URL in local spark session
                            
                                UnsatisfiedLinkError: no snappyjava in java.library.path when running Spark MLLib Unit test within Intellij
                            
                                How can I efficiently read multiple json files into a Dataframe or JavaRDD?
                            
                                spark error RDD type not found when creating RDD
                            
                                What is the best way to define custom methods on a DataFrame?
                            
                                java.lang.NoClassDefFoundError: org/apache/spark/sql/SparkSession
                            
                                Apply same function to all fields of spark dataframe row
                            
                                Pyspark: Replacing value in a column by searching a dictionary
                            
                                pyspark and HDFS commands
                            
                                Making histogram with Spark DataFrame column
                            
                                Keep only duplicates from a DataFrame regarding some field
                            
                                how to cast all columns of dataframe to string
                            
                                Spark streaming multiple sources, reload dataframe
                            
                                Mixed Effects Models in Spark or other technology
                            
                                Spark java Issue creating row with java.util.Map type
                            
                                Efficient text preprocessing using PySpark (clean, tokenize, stopwords, stemming, filter)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Election of new zookeeper leader shuts down the Spark Master

Tags:

apache-zookeeper

apache-spark

Arad

People also ask

1 Answers

Knight71

Recent Activity

Donate For Us