How to run concurrent jobs(actions) in Apache Spark using single spark context

Tags:

It says in Apache Spark documentation "within each Spark application, multiple “jobs” (Spark actions) may be running concurrently if they were submitted by different threads". Can someone explain how to achieve this concurrency for the following sample code?

    SparkConf conf = new SparkConf().setAppName("Simple_App");
    JavaSparkContext sc = new JavaSparkContext(conf);

    JavaRDD<String> file1 = sc.textFile("/path/to/test_doc1");
    JavaRDD<String> file2 = sc.textFile("/path/to/test_doc2");

    System.out.println(file1.count());
    System.out.println(file2.count());

These two jobs are independent and must run concurrently.
Thank You.

653

asked Feb 25 '15 06:02

Sporty

1 Answers

Try something like this:

    final JavaSparkContext sc = new JavaSparkContext("local[2]","Simple_App");
    ExecutorService executorService = Executors.newFixedThreadPool(2);
    // Start thread 1
    Future<Long> future1 = executorService.submit(new Callable<Long>() {
        @Override
        public Long call() throws Exception {
            JavaRDD<String> file1 = sc.textFile("/path/to/test_doc1");
            return file1.count();
        }
    });
    // Start thread 2
    Future<Long> future2 = executorService.submit(new Callable<Long>() {
        @Override
        public Long call() throws Exception {
            JavaRDD<String> file2 = sc.textFile("/path/to/test_doc2");
            return file2.count();
        }
    });
    // Wait thread 1
    System.out.println("File1:"+future1.get());
    // Wait thread 2
    System.out.println("File2:"+future2.get());

answered Oct 06 '22 08:10

G Quintana

Related questions
                            
                                MongoDB sort documents by array elements
                            
                                WKWebView does not finish/timeout when there is no internet access
                            
                                Go "this"-keyword
                            
                                Is jQuery's $.get() safe to call on an untrusted URL?
                            
                                Error while "Export For Localization.." Xcode 6.3
                            
                                Ternary expression sometimes bypasses compiler checks validations
                            
                                add_custom_command is not generating a target
                            
                                Lost important .py file (overwritten as 0byte file), but the old version still LOADED IN IPYTHON as module -- can it be retrieved?
                            
                                Cabal install gtk failing
                            
                                C# Safe navigation operator - what is actually going on?
                            
                                Does restarting a Docker container "remember" initial run arguments?
                            
                                Does the Intel Memory Model make SFENCE and LFENCE redundant?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With