How to implement custom job listener/tracker in Spark?

Tags:

2 Answers

If you are using scala-spark this code will help you to adding spark listener.

Create your SparkContext

val sc=new SparkContext(sparkConf)

Now you can add your spark listener in spark context

sc.addSparkListener(new SparkListener() {
  override def onApplicationStart(applicationStart: SparkListenerApplicationStart) {
    println("Spark ApplicationStart: " + applicationStart.appName);
  }

  override def onApplicationEnd(applicationEnd: SparkListenerApplicationEnd) {
    println("Spark ApplicationEnd: " + applicationEnd.time);
  }

});

Here is the list of Interface for listening to events from the Spark schedule.

answered Sep 16 '22 17:09

Gabber

You should implement SparkListener. Just override whatever events you are interested in (job/stage/task start/end events), then call sc.addSparkListener(myListener).

It does not give you a straight-up percentage-based progress tracker, but at least you can track that progress is being made and its rough rate. The difficulty comes from how unpredictable the number of Spark stages can be, and also how the running times of each stage can be vastly different. The progress within a stage should be more predictable.

answered Sep 16 '22 17:09

Daniel Darabos

Related questions
                            
                                Maintaining Clean Architecture in Spring MVC with a data-centric approach
                            
                                Java 7 API design best practice - return Array or return Collection
                            
                                Redirect System.out and System.err
                            
                                How to make a transparent JFrame but keep everything else the same?
                            
                                Strange error in R.java, even after cleaning the project: "Underscores can only be used with source level 1.7 or greater"
                            
                                Interpretation of "program order rule" in Java concurrency
                            
                                Best way to upload/deploy jar files in Github
                            
                                Understanding JAXB @XmlRootElement annotation
                            
                                How to quickly find the main() in java project using eclipse ?
                            
                                What is Python's equivalent of Java's standard for-loop?
                            
                                How JVM finds method (parameter with the closest matching) to call in case of function overloading
                            
                                JSON post to Spring Controller
                            
                                Create spreadsheet using Google Spreadsheets API in Google Drive
                            
                                package org.apache.commons.lang does not exist [Netbeans]
                            
                                How to populate dropdownlist with JSON data as ajax response in jQuery
                            
                                Typed Array should be recycled after use with #recycle()
                            
                                Why can't load inner class? ClassNotFoundException
                            
                                Hibernate, Postgres & Array Type
                            
                                "Update resources" option missing in IntelliJ IDEA
                            
                                Play Framework 2.2 : Get URL of the requesting page

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to implement custom job listener/tracker in Spark?

Tags:

java

apache-spark

user3705662

People also ask

2 Answers

Gabber

Daniel Darabos

Recent Activity

Donate For Us