Spark executor logs on YARN

Tags:

I'm launching a distributed Spark application in YARN client mode, on a Cloudera cluster. After some time I see some errors on Cloudera Manager. Some executors get disconnected and this happens systematically. I would like to debug the issue but the internal exception is not reported by YARN.

Exception from container-launch with container ID: container_1417503665765_0193_01_000003 and exit code: 1
ExitCodeException exitCode=1: 
    at org.apache.hadoop.util.Shell.runCommand(Shell.java:538)
    at org.apache.hadoop.util.Shell.run(Shell.java:455)
    at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:702)
    at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:196)
    at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:299)
    at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:81)
    at java.util.concurrent.FutureTask.run(FutureTask.java:262)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
    at java.lang.Thread.run(Thread.java:745)

How can I see the stacktrace of the exception? It seems that YARN reports only that the application exited abnormally. Is there a way to see spark executor log in YARN configuration ?

387

asked Dec 06 '14 20:12

Nicola Ferraro

1 Answers

Check NodeManager's yarn.nodemanager.log-dir property. It's the log location of when Spark executor container is running.

Note that when the application finishes NodeManager may remove the files (Log Aggregation). Check this document for detail. http://hortonworks.com/blog/simplifying-user-logs-management-and-access-in-yarn/

150

answered Sep 23 '22 05:09

2 revs, 2 users 80%

Related questions
                            
                                Installing Apache Spark on Ubuntu 14.04
                            
                                Partition data for efficient joining for Spark dataframe/dataset
                            
                                Spark Option: inferSchema vs header = true
                            
                                Spark: Merge 2 dataframes by adding row index/number on both dataframes
                            
                                How to max value and keep all columns (for max records per group)? [duplicate]
                            
                                Set hadoop configuration values on spark-submit command line
                            
                                spark + sbt-assembly: "deduplicate: different file contents found in the following"
                            
                                Spark Dataset select with typedcolumn
                            
                                When are cache and persist executed (since they don't seem like actions)?
                            
                                How to open/stream .zip files through Spark?
                            
                                How to measure the execution time of a query on Spark
                            
                                Apache-Spark : What is map(_._2) shorthand for?
                            
                                scala - Spark : How to union all dataframe in loop
                            
                                Spark MLlib - trainImplicit warning
                            
                                Java heap space OutOfMemoryError in pyspark spark-submit?
                            
                                BigQuery replaced most of my Spark jobs, am I missing something?
                            
                                WARN BlockManagerMasterEndpoint: No more replicas available for rdd
                            
                                Manually calling spark's garbage collection from pyspark
                            
                                javax.servlet.ServletException: java.util.NoSuchElementException: None.get
                            
                                Spark: How to join RDDs by time range

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spark executor logs on YARN

Tags:

apache-spark

hadoop-yarn

cloudera

cloudera-manager

Nicola Ferraro

People also ask

1 Answers

2 revs, 2 users 80%

Recent Activity

Donate For Us