How to properly wait for apache spark launcher job during launching it from another application?

Tags:

I am trying to avoid "while(true)" solution when i waiting until my spark apache job is done, but without success.

I have spark application which suppose to process some data and put a result to database, i do call it from my spring service and would like to wait until the job is done.

Example:

Launcher with method:

@Override
public void run(UUID docId, String query) throws Exception {
    launcher.addAppArgs(docId.toString(), query);

    SparkAppHandle sparkAppHandle = launcher.startApplication();

    sparkAppHandle.addListener(new SparkAppHandle.Listener() {
        @Override
        public void stateChanged(SparkAppHandle handle) {
            System.out.println(handle.getState() + " new  state");
        }

        @Override
        public void infoChanged(SparkAppHandle handle) {
            System.out.println(handle.getState() + " new  state");
        }
    });

    System.out.println(sparkAppHandle.getState().toString());
}

How to wait properly until state of handler is "Finished".

788

asked Mar 28 '16 14:03

Alex Aniska

1 Answers

I am also using SparkLauncher from a Spring application. Here is a summary of the approach that I took (by following examples in the JavaDoc).

The @Service used to launch the job also implements SparkHandle.Listener and passes a reference to itself via .startApplication e.g.

...
...
@Service
public class JobLauncher implements SparkAppHandle.Listener {
...
...
...
private SparkAppHandle launchJob(String mainClass, String[] args) throws Exception {

    String appResource = getAppResourceName();

    SparkAppHandle handle = new SparkLauncher()
        .setAppResource(appResource).addAppArgs(args)
        .setMainClass(mainClass)
        .setMaster(sparkMaster)
        .setDeployMode(sparkDeployMode)
        .setSparkHome(sparkHome)
        .setConf(SparkLauncher.DRIVER_MEMORY, "2g")
        .startApplication(this);

    LOG.info("Launched [" + mainClass + "] from [" + appResource + "] State [" + handle.getState() + "]");

    return handle;
}

/**
* Callback method for changes to the Spark Job
*/
@Override
public void infoChanged(SparkAppHandle handle) {

    LOG.info("Spark App Id [" + handle.getAppId() + "] Info Changed.  State [" + handle.getState() + "]");

}

/**
* Callback method for changes to the Spark Job's state
*/
@Override
public void stateChanged(SparkAppHandle handle) {

    LOG.info("Spark App Id [" + handle.getAppId() + "] State Changed. State [" + handle.getState() + "]");

}

Using this approach, one can take action when the state changes to "FAILED", "FINISHED" or "KILLED".

I hope this information is helpful to you.

164

answered Sep 18 '22 19:09

tegatai

Related questions
                            
                                Replace string in PDF file using Itext but letter X not replace
                            
                                Tomcat Issue: Unsupported major.minor version 52.0 [duplicate]
                            
                                Is this function (for loop) space complexity O(1) or O(n)?
                            
                                Java-8 Stream returned by .map will be parallel or sequential?
                            
                                JSON ORDER_MAP_ENTRIES_BY_KEYS not working consistently
                            
                                How does a thread move from running to runnable state?
                            
                                How do I turn a JSON file into a Java 8 Object Stream?
                            
                                How to find the first character of a String without using any API method
                            
                                Random but most likely 1 float
                            
                                Java remote debugging overhead
                            
                                Java prefix and unary operators together
                            
                                How to remove authorization header in a http 302 response
                            
                                Stetho: Don't see Network calls in console
                            
                                Spring Security SAML, Redirects going to HTTP instead of HTTPS when using SAMLContextProviderLB set to HTTPs scheme
                            
                                How to solve "Unhandled exception type BeansException"
                            
                                Java EE authentication: how to capture login event?
                            
                                JavaDoc - Can I divide methods into groups?
                            
                                How to run test methods in order with Junit
                            
                                How to use java 8 merge function for n number of hashmaps
                            
                                Find the first occurrence with Regex and Java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to properly wait for apache spark launcher job during launching it from another application?

Tags:

java

apache-spark

spark-launcher

Alex Aniska

People also ask

1 Answers

tegatai

Recent Activity

Donate For Us