I have one producer and many consumers. <ul> <li>the producer is fast and generating a lot of results</li> <li>tokens with the same value need to be processed sequentially</li> <li>tokens with different values must be processed in parallel</li> <li>creating new Runnables would be very expensive and also the production code could work with 100k of Tokens(in order to create a Runnable I have to pass to the constructor some complex to build objects) </li> </ul> Can I achieve the same results with a simpler algorithm? Nesting a syncronization block with a reentrant lock seems a bit unnatural. Are there any race conditions you might notice? Update: a second solution I found was working with 3 collections. One to cache the producer results, second a blocking queue and 3rd using a list to track in the tasks in progress. Again a bit to complicated. My version of code <pre class="prettyprint"><code>import java.util.*; import java.util.concurrent.*; import java.util.concurrent.locks.ReentrantLock; public class Main1 { static class Token { private int order; private String value; Token() { } Token(int o, String v) { order = o; value = v; } int getOrder() { return order; } String getValue() { return value; } } private final static BlockingQueue<Token> queue = new ArrayBlockingQueue<Token>(10); private final static ConcurrentMap<String, Object> locks = new ConcurrentHashMap<String, Object>(); private final static ReentrantLock reentrantLock = new ReentrantLock(); private final static Token STOP_TOKEN = new Token(); private final static List<String> lockList = Collections.synchronizedList(new ArrayList<String>()); public static void main(String[] args) { ExecutorService producerExecutor = Executors.newSingleThreadExecutor(); producerExecutor.submit(new Runnable() { public void run() { Random random = new Random(); try { for (int i = 1; i <= 100; i++) { Token token = new Token(i, String.valueOf(random.nextInt(1))); queue.put(token); } queue.put(STOP_TOKEN); }catch(InterruptedException e){ e.printStackTrace(); } } }); ExecutorService consumerExecutor = Executors.newFixedThreadPool(10); for(int i=1; i<=10;i++) { // creating to many runnable would be inefficient because of this complex not thread safe object final Object dependecy = new Object(); //new ComplexDependecy() consumerExecutor.submit(new Runnable() { public void run() { while(true) { try { //not in order Token token = queue.take(); if (token == STOP_TOKEN) { queue.add(STOP_TOKEN); return; } System.out.println("Task start" + Thread.currentThread().getId() + " order " + token.getOrder()); Random random = new Random(); Thread.sleep(random.nextInt(200)); //doLongRunningTask(dependecy) lockList.remove(token.getValue()); } catch (InterruptedException e) { e.printStackTrace(); } } }}); } }} </code></pre>

You can pre-create set of <code>Runnables</code> which will pick incoming tasks (tokens) and place them in queues according to their order value. As pointed out in comments, it's not guaranteed that tokens with different values will always execute in parallel (all in all, you are bounded, at least, by nr of physical cores in your box). However, it is guaranteed that tokens with same order will be executed in the order of arrival. Sample code: <pre class="prettyprint"><code>/** * Executor which ensures incoming tasks are executed in queues according to provided key (see {@link Task#getOrder()}). */ public class TasksOrderingExecutor { public interface Task extends Runnable { /** * @return ordering value which will be used to sequence tasks with the same value. * Tasks with different ordering values may be executed in parallel, but not guaranteed to. */ String getOrder(); } private static class Worker implements Runnable { private final LinkedBlockingQueue<Task> tasks = new LinkedBlockingQueue<>(); private volatile boolean stopped; void schedule(Task task) { tasks.add(task); } void stop() { stopped = true; } @Override public void run() { while (!stopped) { try { Task task = tasks.take(); task.run(); } catch (InterruptedException ie) { // perhaps, handle somehow } } } } private final Worker[] workers; private final ExecutorService executorService; /** * @param queuesNr nr of concurrent task queues */ public TasksOrderingExecutor(int queuesNr) { Preconditions.checkArgument(queuesNr >= 1, "queuesNr >= 1"); executorService = new ThreadPoolExecutor(queuesNr, queuesNr, 0, TimeUnit.SECONDS, new SynchronousQueue<>()); workers = new Worker[queuesNr]; for (int i = 0; i < queuesNr; i++) { Worker worker = new Worker(); executorService.submit(worker); workers[i] = worker; } } public void submit(Task task) { Worker worker = getWorker(task); worker.schedule(task); } public void stop() { for (Worker w : workers) w.stop(); executorService.shutdown(); } private Worker getWorker(Task task) { return workers[task.getOrder().hashCode() % workers.length]; } } </code></pre>

Both sequential and parallel processing

Tags:

java

synchronization

multithreading

locking

I have one producer and many consumers.

the producer is fast and generating a lot of results
tokens with the same value need to be processed sequentially
tokens with different values must be processed in parallel
creating new Runnables would be very expensive and also the production code could work with 100k of Tokens(in order to create a Runnable I have to pass to the constructor some complex to build objects)

Can I achieve the same results with a simpler algorithm? Nesting a syncronization block with a reentrant lock seems a bit unnatural. Are there any race conditions you might notice?

Update: a second solution I found was working with 3 collections. One to cache the producer results, second a blocking queue and 3rd using a list to track in the tasks in progress. Again a bit to complicated.

My version of code

import java.util.*;
import java.util.concurrent.*;
import java.util.concurrent.locks.ReentrantLock;

public class Main1 {
    static class Token {
        private int order;
        private String value;
        Token() {

        }
        Token(int o, String v) {
            order = o;
            value = v;
        }

        int getOrder() {
            return order;
        }

        String getValue() {
            return value;
        }
    }

    private final static BlockingQueue<Token> queue = new ArrayBlockingQueue<Token>(10);
    private final static ConcurrentMap<String, Object> locks = new ConcurrentHashMap<String, Object>();
    private final static ReentrantLock reentrantLock = new ReentrantLock();
    private final static Token STOP_TOKEN = new Token();
    private final static List<String> lockList = Collections.synchronizedList(new ArrayList<String>());

    public static void main(String[] args) {
        ExecutorService producerExecutor = Executors.newSingleThreadExecutor();
        producerExecutor.submit(new Runnable() {
            public void run() {
                Random random = new Random();
                    try {
                        for (int i = 1; i <= 100; i++) {
                            Token token = new Token(i, String.valueOf(random.nextInt(1)));

                            queue.put(token);
                        }

                        queue.put(STOP_TOKEN);
                    }catch(InterruptedException e){
                        e.printStackTrace();
                    }
                }
        });

        ExecutorService consumerExecutor = Executors.newFixedThreadPool(10);
        for(int i=1; i<=10;i++) {

            // creating to many runnable would be inefficient because of this complex not thread safe object
            final Object dependecy = new Object(); //new ComplexDependecy()
            consumerExecutor.submit(new Runnable() {
                public void run() {
                    while(true) {
                        try {
                            //not in order


                            Token token = queue.take();
                            if (token == STOP_TOKEN) {
                                queue.add(STOP_TOKEN);
                                return;
                            }


                            System.out.println("Task start" + Thread.currentThread().getId() + " order "  + token.getOrder());

                            Random random = new Random();
                            Thread.sleep(random.nextInt(200)); //doLongRunningTask(dependecy)
                            lockList.remove(token.getValue());

                        } catch (InterruptedException e) {
                            e.printStackTrace();
                        }
                    }
            }});

    }
}}

302

asked Jan 08 '16 19:01

johnlemon

2 Answers

You can pre-create set of Runnables which will pick incoming tasks (tokens) and place them in queues according to their order value.

As pointed out in comments, it's not guaranteed that tokens with different values will always execute in parallel (all in all, you are bounded, at least, by nr of physical cores in your box). However, it is guaranteed that tokens with same order will be executed in the order of arrival.

Sample code:

/**
 * Executor which ensures incoming tasks are executed in queues according to provided key (see {@link Task#getOrder()}).
 */
public class TasksOrderingExecutor {

    public interface Task extends Runnable {
        /**
         * @return ordering value which will be used to sequence tasks with the same value.<br>
         * Tasks with different ordering values <i>may</i> be executed in parallel, but not guaranteed to.
         */
        String getOrder();
    }

    private static class Worker implements Runnable {

        private final LinkedBlockingQueue<Task> tasks = new LinkedBlockingQueue<>();

        private volatile boolean stopped;

        void schedule(Task task) {
            tasks.add(task);
        }

        void stop() {
            stopped = true;
        }

        @Override
        public void run() {
            while (!stopped) {
                try {
                    Task task = tasks.take();
                    task.run();
                } catch (InterruptedException ie) {
                    // perhaps, handle somehow
                }
            }
        }
    }

    private final Worker[] workers;
    private final ExecutorService executorService;

    /**
     * @param queuesNr nr of concurrent task queues
     */
    public TasksOrderingExecutor(int queuesNr) {
        Preconditions.checkArgument(queuesNr >= 1, "queuesNr >= 1");
        executorService = new ThreadPoolExecutor(queuesNr, queuesNr, 0, TimeUnit.SECONDS, new SynchronousQueue<>());
        workers = new Worker[queuesNr];
        for (int i = 0; i < queuesNr; i++) {
            Worker worker = new Worker();
            executorService.submit(worker);
            workers[i] = worker;
        }
    }

    public void submit(Task task) {
        Worker worker = getWorker(task);
        worker.schedule(task);
    }

    public void stop() {
        for (Worker w : workers) w.stop();
        executorService.shutdown();
    }

    private Worker getWorker(Task task) {
        return workers[task.getOrder().hashCode() % workers.length];
    }
}

answered Oct 18 '22 07:10

Victor Sorokin

By the nature of your code, the only way to guarantee that the tokens with the same value are processed in serial manner is to wait for STOP_TOKEN to arrive.

You'll need single producer-single consumer setup, with consumer collecting and sorting the tokens by their value (into the Multimap, let say).

Only then you know which tokens can be process serially and which may be processed in parallel.

Anyway, I advise you to look at LMAX Disruptor, which offers very effective way for sharing data between threads.

It doesn't suffer from synchronization overhead as Executors as it is lock free (which may give you nice performance benefits, depending on the way how you process the data).

The solution using two Disruptors

// single thread for processing as there will be only on consumer
Disruptor<InEvent> inboundDisruptor = new Disruptor<>(InEvent::new, 32, Executors.newSingleThreadExecutor());

// outbound disruptor that uses 3 threads for event processing
Disruptor<OutEvent> outboundDisruptor = new Disruptor<>(OutEvent::new, 32, Executors.newFixedThreadPool(3));

inboundDisruptor.handleEventsWith(new InEventHandler(outboundDisruptor));

// setup 3 event handlers, doing round robin consuming, effectively processing OutEvents in 3 threads
outboundDisruptor.handleEventsWith(new OutEventHandler(0, 3, new Object()));
outboundDisruptor.handleEventsWith(new OutEventHandler(1, 3, new Object()));
outboundDisruptor.handleEventsWith(new OutEventHandler(2, 3, new Object()));

inboundDisruptor.start();
outboundDisruptor.start();

// publisher code
for (int i = 0; i < 10; i++) {
    inboundDisruptor.publishEvent(InEventTranslator.INSTANCE, new Token());
}

The event handler on the inbound disruptor just collects incoming tokens. When STOP token is received, it publishes the series of tokens to outbound disruptor for further processing:

public class InEventHandler implements EventHandler<InEvent> {

    private ListMultimap<String, Token> tokensByValue = ArrayListMultimap.create();
    private Disruptor<OutEvent> outboundDisruptor;

    public InEventHandler(Disruptor<OutEvent> outboundDisruptor) {
        this.outboundDisruptor = outboundDisruptor;
    }

    @Override
    public void onEvent(InEvent event, long sequence, boolean endOfBatch) throws Exception {
        if (event.token == STOP_TOKEN) {
            // publish indexed tokens to outbound disruptor for parallel processing
            tokensByValue.asMap().entrySet().stream().forEach(entry -> outboundDisruptor.publishEvent(OutEventTranslator.INSTANCE, entry.getValue()));
        } else {
            tokensByValue.put(event.token.value, event.token);
        }
    }
}

Outbound event handler processes tokens of the same value sequentially:

public class OutEventHandler implements EventHandler<OutEvent> {

    private final long order;
    private final long allHandlersCount;
    private Object yourComplexDependency;

    public OutEventHandler(long order, long allHandlersCount, Object yourComplexDependency) {
        this.order = order;
        this.allHandlersCount = allHandlersCount;
        this.yourComplexDependency = yourComplexDependency;
    }

    @Override
    public void onEvent(OutEvent event, long sequence, boolean endOfBatch) throws Exception {
        if (sequence % allHandlersCount != order ) {
            // round robin, do not consume every event to allow parallel processing
            return;
        }

        for (Token token : event.tokensToProcessSerially) {
            // do procesing of the token using your complex class
        }

    }
}

The rest of the required infrastructure (purpose described in the Disruptor docs):

public class InEventTranslator implements EventTranslatorOneArg<InEvent, Token> {

    public static final InEventTranslator INSTANCE = new InEventTranslator();

    @Override
    public void translateTo(InEvent event, long sequence, Token arg0) {
        event.token = arg0;
    }

}

public class OutEventTranslator implements EventTranslatorOneArg<OutEvent, Collection<Token>> {

    public static final OutEventTranslator INSTANCE = new OutEventTranslator();

    @Override
    public void translateTo(OutEvent event, long sequence, Collection<Token> tokens) {
        event.tokensToProcessSerially = tokens;
    }
}


public class InEvent {

    // Note that no synchronization is used here,
    // even though the field is used among multiple threads.
    // Memory barrier used by Disruptor guarantee changes are visible.
    public Token token;
}

public class OutEvent {
    // ... again, no locks.
    public Collection<Token> tokensToProcessSerially;

}

public class Token {
    String value;

}

answered Oct 18 '22 07:10

David Siro

Related questions
                            
                                Cardinal direction algorithm in Java
                            
                                logical OR operator vs bitwise OR operator
                            
                                How to generate ssh compatible id_rsa(.pub) from Java
                            
                                Complex queries with JPA Criteria builder
                            
                                How to display parent(s) methods/variables in Outline?
                            
                                In Java, are variables declared inside static methods themselves static?
                            
                                java why should equals method input parameter be Object
                            
                                Proper usage of Apache Commons Configuration
                            
                                Does Spring embedded database support different SQL dialects?
                            
                                Set DatePickerDialog title permanently
                            
                                How to do mouse hover using Selenium WebDriver in Firefox 19?
                            
                                ArrayList<> vs ArrayList<Integer>
                            
                                What is the use of Pair Class and why have the implemented here?
                            
                                From the day week number get the day name with Joda Time
                            
                                Project 'X' is missing required Java project: 'Y'
                            
                                Spring @Async ignored
                            
                                How to enable logging in jetty?
                            
                                Json String to Java Object Avro
                            
                                Problems Resteasy 3.09 CorsFilter
                            
                                Are Spring's MockMvc used for unit testing or integration testing?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With