Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Forcing multiple threads to use multiple CPUs when they are available

I'm writing a Java program which uses a lot of CPU because of the nature of what it does. However, lots of it can run in parallel, and I have made my program multi-threaded. When I run it, it only seems to use one CPU until it needs more then it uses another CPU - is there anything I can do in Java to force different threads to run on different cores/CPUs?

like image 703
Nosrama Avatar asked Aug 03 '09 15:08

Nosrama


People also ask

Can a threads be run on multiple CPUs?

In short: yes, a thread can run on different cores.

Can multiple threads run on multiple cores?

In computer architecture, multithreading is the ability of a central processing unit (CPU) (or a single core in a multi-core processor) to provide multiple threads of execution concurrently, supported by the operating system. This approach differs from multiprocessing.

What allows CPU to handle multiple threads simultaneously?

Superscalar means executing multiple instructions at the same time while thread-level parallelism (TLP) executes instructions from multiple threads within one processor chip at the same time.

Why do we need multiple threads instead of multiple processes?

You'd prefer multiple threads over multiple processes for two reasons: Inter-thread communication (sharing data etc.) is significantly simpler to program than inter-process communication. Context switches between threads are faster than between processes.


1 Answers

There are two basic ways to multi-thread in Java. Each logical task you create with these methods should run on a fresh core when needed and available.

Method one: define a Runnable or Thread object (which can take a Runnable in the constructor) and start it running with the Thread.start() method. It will execute on whatever core the OS gives it -- generally the less loaded one.

Tutorial: Defining and Starting Threads

Method two: define objects implementing the Runnable (if they don't return values) or Callable (if they do) interface, which contain your processing code. Pass these as tasks to an ExecutorService from the java.util.concurrent package. The java.util.concurrent.Executors class has a bunch of methods to create standard, useful kinds of ExecutorServices. Link to Executors tutorial.

From personal experience, the Executors fixed & cached thread pools are very good, although you'll want to tweak thread counts. Runtime.getRuntime().availableProcessors() can be used at run-time to count available cores. You'll need to shut down thread pools when your application is done, otherwise the application won't exit because the ThreadPool threads stay running.

Getting good multicore performance is sometimes tricky, and full of gotchas:

  • Disk I/O slows down a LOT when run in parallel. Only one thread should do disk read/write at a time.
  • Synchronization of objects provides safety to multi-threaded operations, but slows down work.
  • If tasks are too trivial (small work bits, execute fast) the overhead of managing them in an ExecutorService costs more than you gain from multiple cores.
  • Creating new Thread objects is slow. The ExecutorServices will try to re-use existing threads if possible.
  • All sorts of crazy stuff can happen when multiple threads work on something. Keep your system simple and try to make tasks logically distinct and non-interacting.

One other problem: controlling work is hard! A good practice is to have one manager thread that creates and submits tasks, and then a couple working threads with work queues (using an ExecutorService).

I'm just touching on key points here -- multithreaded programming is considered one of the hardest programming subjects by many experts. It's non-intuitive, complex, and the abstractions are often weak.


Edit -- Example using ExecutorService:

public class TaskThreader {     class DoStuff implements Callable {        Object in;        public Object call(){          in = doStep1(in);          in = doStep2(in);          in = doStep3(in);           return in;        }        public DoStuff(Object input){           in = input;        }     }      public abstract Object doStep1(Object input);         public abstract Object doStep2(Object input);         public abstract Object doStep3(Object input);          public static void main(String[] args) throws Exception {         ExecutorService exec = Executors.newFixedThreadPool(Runtime.getRuntime().availableProcessors());         ArrayList<Callable> tasks = new ArrayList<Callable>();         for(Object input : inputs){            tasks.add(new DoStuff(input));         }         List<Future> results = exec.invokeAll(tasks);         exec.shutdown();         for(Future f : results) {            write(f.get());         }     } } 
like image 184
BobMcGee Avatar answered Sep 21 '22 05:09

BobMcGee