Spark streaming job fails after getting stopped by Driver

Tags:

I have a spark streaming job which reads in data from Kafka and does some operations on it. I am running the job over a yarn cluster, Spark 1.4.1, which has two nodes with 16 GB RAM each and 16 cores each.

I have these conf passed to the spark-submit job :

--master yarn-cluster --num-executors 3 --driver-memory 4g --executor-memory 2g --executor-cores 3

The job returns this error and finishes after running for a short while :

INFO yarn.ApplicationMaster: Final app status: FAILED, exitCode: 11,
(reason: Max number of executor failures reached)

.....

ERROR scheduler.ReceiverTracker: Deregistered receiver for stream 0:
Stopped by driver

Updated :

These logs were found too :

INFO yarn.YarnAllocator: Received 3 containers from YARN, launching executors on 3 of them.....

INFO yarn.ApplicationMaster$AMEndpoint: Driver terminated or disconnected! Shutting down.

....

INFO yarn.YarnAllocator: Received 2 containers from YARN, launching executors on 2 of them.

INFO yarn.ExecutorRunnable: Starting Executor Container.....

INFO yarn.ApplicationMaster$AMEndpoint: Driver terminated or disconnected! Shutting down...

INFO yarn.YarnAllocator: Completed container container_e10_1453801197604_0104_01_000006 (state: COMPLETE, exit status: 1)

INFO yarn.YarnAllocator: Container marked as failed: container_e10_1453801197604_0104_01_000006. Exit status: 1. Diagnostics: Exception from container-launch.
Container id: container_e10_1453801197604_0104_01_000006
Exit code: 1
Stack trace: ExitCodeException exitCode=1: 
    at org.apache.hadoop.util.Shell.runCommand(Shell.java:576)
    at org.apache.hadoop.util.Shell.run(Shell.java:487)
    at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:753)
    at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:211)
    at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:302)
    at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:82)
    at java.util.concurrent.FutureTask.run(FutureTask.java:266)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
    at java.lang.Thread.run(Thread.java:745)

Container exited with a non-zero exit code 1

What might be the reasons for this? Appreciate some help.

Thanks

819

asked Feb 01 '16 06:02

void

1 Answers

can you please show your scala/java code that is reading from kafka? I suspect you probably not creating your SparkConf correctly.

Try something like

SparkConf sparkConf = new SparkConf().setAppName("ApplicationName");

also try running application in yarn-client mode and share the output.

104

answered Sep 29 '22 06:09

Faisal Ahmed Siddiqui

Related questions
                            
                                Inferring Spark DataType from string literals
                            
                                Multiple driver-java-options in spark submit
                            
                                Equivalent to left outer join in SPARK
                            
                                How do I truncate a PySpark dataframe of timestamp type to the day?
                            
                                Hadoop 2.9.2, Spark 2.4.0 access AWS s3a bucket
                            
                                Issue with VectorUDT when using Spark ML
                            
                                How to create a Row from a List or Array in Spark using java
                            
                                Why does starting a streaming query lead to "ExitCodeException exitCode=-1073741515"?
                            
                                How to load jar dependenices in IPython Notebook
                            
                                Spark Java saveAsTable failes with ArrayIndexOutOfBoundsException
                            
                                Use SparkContext hadoop configuration within RDD methods/closures, like foreachPartition
                            
                                Spark Scala: How to convert Dataframe[vector] to DataFrame[f1:Double, ..., fn: Double)]
                            
                                How to install apache-spark 2.2.0 with homebrew on Mac
                            
                                Remove blank space from data frame column values in Spark
                            
                                Apache Spark: Using folder structures to reduce run-time of analyses
                            
                                Spark - Sum of row values

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spark streaming job fails after getting stopped by Driver

Tags:

apache-kafka

apache-spark

hadoop-yarn

spark-streaming

void

People also ask

1 Answers

Faisal Ahmed Siddiqui

Recent Activity

Donate For Us