How to fetch Spark Streaming job statistics using REST calls when running in yarn-cluster mode

Tags:

I have a spark streaming program running on Yarn Cluster in "yarn-cluster" mode. (-master yarn-cluster). I want to fetch spark job statistics using REST APIs in json format. I am able to fetch basic statistics using REST url call: http://yarn-cluster:8088/proxy/application_1446697245218_0091/metrics/json. But this is giving very basic statistics.

However I want to fetch per executor or per RDD based statistics. How to do that using REST calls and where I can find the exact REST url to get these statistics. Though $SPARK_HOME/conf/metrics.properties file sheds some light regarding urls i.e.

5. MetricsServlet is added by default as a sink in master, worker and client driver, you can send http request "/metrics/json" to get a snapshot of all the registered metrics in json format. For master, requests "/metrics/master/json" and "/metrics/applications/json" can be sent seperately to get metrics snapshot of instance master and applications. MetricsServlet may not be configured by self.

but that is fetching html pages not json. Only "/metrics/json" fetches stats in json format. On top of that knowing application_id pro-grammatically is a challenge in itself when running in yarn-cluster mode.

I checked REST API section of Spark Monitoring page, but that didn't worked when we run spark job in yarn-cluster mode. Any pointers/answers are welcomed.

527

asked Dec 29 '15 08:12

ramanKC

3 Answers

You should be able to access the Spark REST API using:

http://yarn-cluster:8088/proxy/application_1446697245218_0091/api/v1/applications/

From here you can select the app-id from the list and then use the following endpoint to get information about executors, for example:

http://yarn-cluster:8088/proxy/application_1446697245218_0091/api/v1/applications/{app-id}/executors

I verified this with my spark streaming application that is running in yarn cluster mode.

I'll explain how I arrived at the JSON response using a web browser. (This is for a Spark 1.5.2 streaming application in yarn-cluster mode).

First, use the hadoop url to view the RUNNING applications. http://{yarn-cluster}:8088/cluster/apps/RUNNING.

Next, select a running application, say http://{yarn-cluster}:8088/cluster/app/application_1450927949656_0021.

Next, click on the TrackingUrl link. This uses a proxy and the port is different in my case: http://{yarn-proxy}l:20888/proxy/application_1450927949656_0021/. This shows the spark UI. Now, append the api/v1/applications to this URL: http://{yarn-proxy}l:20888/proxy/application_1450927949656_0021/api/v1/applications.

You should see a JSON response with the application name supplied to SparkConf and the start time of the application.

answered Oct 13 '22 13:10

user5728085

I was able to reconstruct the metrics in the columns seen in the Spark Streaming web UI (batch start time, processing delay, scheduling delay) using the /jobs/ endpoint.

The script I used is available here. I wrote a short post describing and tying its functionality back to the Spark codebase. This does not need any web-scraping.

It works for Spark 2.0.0 and YARN 2.7.2, but may work for other version combinations too.

answered Oct 13 '22 13:10

Emaad Ahmed Manzoor

You'll need to scrape through the HTML page to get the relevant metrics. There isn't a Spark rest endpoint for capturing this info.

answered Oct 13 '22 13:10

Sachin

Related questions
                            
                                Is it possible to run spark yarn cluster from the code?
                            
                                how does YARN "Fair Scheduler" work with spark-submit configuration parameter
                            
                                Yarn get logs with rest API
                            
                                How YARN knows data locality in Apache spark in cluster mode
                            
                                How do I run Spark jobs concurrently in the same AWS EMR cluster ?
                            
                                "Can't get Kerberos realm" on yarn cluster
                            
                                Can sparklyr be used with spark deployed on yarn-managed hadoop cluster?
                            
                                Hadoop maps are failing due to ConnectException
                            
                                Spark coalesce relationship with number of executors and cores
                            
                                HADOOP YARN - Application is added to the scheduler and is not yet activated. Skipping AM assignment as cluster resource is empty
                            
                                Controling and monitorying number of simultaneous map/reduce tasks in YARN
                            
                                How can get memory and CPU usage of hadoop yarn application?
                            
                                Spark executor on yarn-client does not take executor core count configuration.
                            
                                What does container/resource allocation mean in Hadoop and in Spark when running on Yarn?
                            
                                How do Spark scheduler pools work when running on YARN?
                            
                                Mapreduce job fail when submitted from windows machine
                            
                                spark on yarn run double times when error [duplicate]
                            
                                Why would Spark executors be removed (with "ExecutorAllocationManager: Request to remove executorIds" in the logs)?
                            
                                What's the right way to use historyserver of hadoop 2.2?
                            
                                In Spark's client mode, the driver needs network access to remote executors?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to fetch Spark Streaming job statistics using REST calls when running in yarn-cluster mode

Tags:

hadoop-yarn

spark-streaming

ramanKC

People also ask

3 Answers

user5728085

Emaad Ahmed Manzoor

Sachin

Recent Activity

Donate For Us