I'm familiar with functional programming languages, usually in Scala and Javascript. I'm working on a Java8 project and not sure how I am supposed to run through a list/stream of item, and perform some side-effect for each of them in parallel, using a custom thread pool, and return an object on which it's possible to listen for completion (wether it's a success or failure). Currently I have the following code, it seems to work (I'm using Play framework Promise implementation as return) but it seems not ideal because ForkJoinPool is not meant to be used for IO intensive computations in the first place. <pre class="prettyprint"><code>public static F.Promise<Void> performAllItemsBackup(Stream<Item> items) { ForkJoinPool pool = new ForkJoinPool(3); ForkJoinTask<F.Promise<Void>> result = pool .submit(() -> { try { items.parallel().forEach(performSingleItemBackup); return F.Promise.<Void>pure(null); } catch (Exception e) { return F.Promise.<Void>throwing(e); } }); try { return result.get(); } catch (Exception e) { throw new RuntimeException("Unable to get result", e); } } </code></pre> Can someone give me a more idiomatic implementation of the above function? Ideally not using the ForkJoinPool, using a more standard return type, and most recent Java8 APIs? Not sure what I'm supposed to use between CompletableFuture, CompletionStage, ForkJoinTask...

A canonical solution would be <pre class="prettyprint"><code>public static CompletableFuture<Void> performAllItemsBackup(Stream<Item> items) { ForkJoinPool pool = new ForkJoinPool(3); try { return CompletableFuture.allOf( items.map(CompletableFuture::completedFuture) .map(f -> f.thenAcceptAsync(performSingleItemBackup, pool)) .toArray(CompletableFuture<?>[]::new)); } finally { pool.shutdown(); } } </code></pre> Note that the interaction between ForkJoin pool and parallel streams is an unspecified implementation detail you should not rely on. In contrast, <code>CompletableFuture</code> provides a dedicated API for providing an <code>Executor</code>. It doesn’t even have to be a <code>ForkJoinPool</code>: <pre class="prettyprint"><code>public static CompletableFuture<Void> performAllItemsBackup(Stream<Item> items) { ExecutorService pool = Executors.newFixedThreadPool(3); try { return CompletableFuture.allOf( items.map(CompletableFuture::completedFuture) .map(f -> f.thenAcceptAsync(performSingleItemBackup, pool)) .toArray(CompletableFuture<?>[]::new)); } finally { pool.shutdown(); } } </code></pre> In either case, you should shut down the executor explicitly instead of relying on automatic cleanup. If you need a <code>F.Promise<Void></code> result, you can use <pre class="prettyprint"><code>public static F.Promise<Void> performAllItemsBackup(Stream<Item> items) { ExecutorService pool = Executors.newFixedThreadPool(3); try { return CompletableFuture.allOf( items.map(CompletableFuture::completedFuture) .map(f -> f.thenAcceptAsync(performSingleItemBackup, pool)) .toArray(CompletableFuture<?>[]::new)) .handle((v, e) -> e!=null? F.Promise.<Void>throwing(e): F.Promise.pure(v)) .join(); } finally { pool.shutdown(); } } </code></pre> But note that this, like your original code, only returns when the operation has been completed, while the methods returning a <code>CompletableFuture</code> allow the operations to run asynchronously until the caller invokes <code>join</code> or <code>get</code>. To return a truly asynchronous <code>Promise</code>, you have to wrap the entire operation, e.g. <pre class="prettyprint"><code>public static F.Promise<Void> performAllItemsBackup(Stream<Item> stream) { return F.Promise.pure(stream).flatMap(items -> { ExecutorService pool = Executors.newFixedThreadPool(3); try { return CompletableFuture.allOf( items.map(CompletableFuture::completedFuture) .map(f -> f.thenAcceptAsync(performSingleItemBackup, pool)) .toArray(CompletableFuture<?>[]::new)) .handle((v, e) -> e!=null? F.Promise.<Void>throwing(e): F.Promise.pure(v)) .join(); } finally { pool.shutdown(); } }); } </code></pre> But it’s better to decide for one API instead of jumping back and forth between two different APIs.

Run IO computations in parallel in Java8

Tags:

java

java-8

I'm familiar with functional programming languages, usually in Scala and Javascript. I'm working on a Java8 project and not sure how I am supposed to run through a list/stream of item, and perform some side-effect for each of them in parallel, using a custom thread pool, and return an object on which it's possible to listen for completion (wether it's a success or failure).

Currently I have the following code, it seems to work (I'm using Play framework Promise implementation as return) but it seems not ideal because ForkJoinPool is not meant to be used for IO intensive computations in the first place.

public static F.Promise<Void> performAllItemsBackup(Stream<Item> items) {
    ForkJoinPool pool = new ForkJoinPool(3);
    ForkJoinTask<F.Promise<Void>> result = pool
            .submit(() -> {
                try {
                    items.parallel().forEach(performSingleItemBackup);
                    return F.Promise.<Void>pure(null);
                } catch (Exception e) {
                    return F.Promise.<Void>throwing(e);
                }
            });

    try {
        return result.get();
    } catch (Exception e) {
        throw new RuntimeException("Unable to get result", e);
    }
}

Can someone give me a more idiomatic implementation of the above function? Ideally not using the ForkJoinPool, using a more standard return type, and most recent Java8 APIs? Not sure what I'm supposed to use between CompletableFuture, CompletionStage, ForkJoinTask...

290

asked Jan 03 '18 11:01

Sebastien Lorber

1 Answers

A canonical solution would be

public static CompletableFuture<Void> performAllItemsBackup(Stream<Item> items) {
    ForkJoinPool pool = new ForkJoinPool(3);
    try {
        return CompletableFuture.allOf(
            items.map(CompletableFuture::completedFuture)
                 .map(f -> f.thenAcceptAsync(performSingleItemBackup, pool))
                 .toArray(CompletableFuture<?>[]::new));
    } finally {
        pool.shutdown();
    }
}

Note that the interaction between ForkJoin pool and parallel streams is an unspecified implementation detail you should not rely on. In contrast, CompletableFuture provides a dedicated API for providing an Executor. It doesn’t even have to be a ForkJoinPool:

public static CompletableFuture<Void> performAllItemsBackup(Stream<Item> items) {
    ExecutorService pool = Executors.newFixedThreadPool(3);
    try {
        return CompletableFuture.allOf(
            items.map(CompletableFuture::completedFuture)
                 .map(f -> f.thenAcceptAsync(performSingleItemBackup, pool))
                 .toArray(CompletableFuture<?>[]::new));
    } finally {
        pool.shutdown();
    }
}

In either case, you should shut down the executor explicitly instead of relying on automatic cleanup.

If you need a F.Promise<Void> result, you can use

public static F.Promise<Void> performAllItemsBackup(Stream<Item> items) {
    ExecutorService pool = Executors.newFixedThreadPool(3);
    try {
        return CompletableFuture.allOf(
            items.map(CompletableFuture::completedFuture)
                 .map(f -> f.thenAcceptAsync(performSingleItemBackup, pool))
                 .toArray(CompletableFuture<?>[]::new))
            .handle((v, e) -> e!=null? F.Promise.<Void>throwing(e): F.Promise.pure(v))
            .join();
    } finally {
        pool.shutdown();
    }
}

But note that this, like your original code, only returns when the operation has been completed, while the methods returning a CompletableFuture allow the operations to run asynchronously until the caller invokes join or get.

To return a truly asynchronous Promise, you have to wrap the entire operation, e.g.

public static F.Promise<Void> performAllItemsBackup(Stream<Item> stream) {
    return F.Promise.pure(stream).flatMap(items -> {
        ExecutorService pool = Executors.newFixedThreadPool(3);
        try {
            return CompletableFuture.allOf(
                items.map(CompletableFuture::completedFuture)
                     .map(f -> f.thenAcceptAsync(performSingleItemBackup, pool))
                     .toArray(CompletableFuture<?>[]::new))
                .handle((v, e) -> e!=null? F.Promise.<Void>throwing(e): F.Promise.pure(v))
                .join();
        } finally {
            pool.shutdown();
        }
    });
}

But it’s better to decide for one API instead of jumping back and forth between two different APIs.

answered Oct 19 '22 23:10

Holger

Related questions
                            
                                Pure Java/Scala code for writing Tensorflow TFRecords data file
                            
                                Eclipse installer "Java for Windows Missing"
                            
                                Static reference ( with :: ) to a method returning an interface
                            
                                insufficient memory for the Java Runtime Environment to continue though RAM is showing 6 GB free space
                            
                                Why is a static local class not allowed in a method? [closed]
                            
                                How can I use the Google Maps APIs in a JavaFX Desktop Application?
                            
                                Is it important to copy a reference to a local variable before using it
                            
                                Getting "java.io.IOException: An existing connection was forcibly closed by the remote host"
                            
                                How to skip or avoid 'retake and review' option after capturing photo from camera using ACTION_IMAGE_CAPTURE
                            
                                Creating spectrogram from .wav using FFT in java
                            
                                RxJava only check the first response item with timeout
                            
                                How can I abort Spring-Boot startup?
                            
                                How can I access 'spring.application.name' when defined in bootstrap.properties?
                            
                                Spring Boot Swagger 2 Configuration Error creating bean with name 'documentationPluginsBootstrapper'
                            
                                Java/Gradle reading external config files
                            
                                Animate activity which is not part of your app
                            
                                java.lang.ExceptionInInitializerError with gradle
                            
                                Conditionally Remove Java Methods at Compile-Time
                            
                                Spring boot Could not locate PropertySource: label not found
                            
                                DTO conveter pattern in Spring Boot

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With