Java thread creation overhead

Tags:

Conventional wisdom tells us that high-volume enterprise java applications should use thread pooling in preference to spawning new worker threads. The use of java.util.concurrent makes this straightforward.

There do exist situations, however, where thread pooling is not a good fit. The specific example which I am currently wrestling with is the use of InheritableThreadLocal, which allows ThreadLocal variables to be "passed down" to any spawned threads. This mechanism breaks when using thread pools, since the worker threads are generally not spawned from the request thread, but are pre-existing.

Now there are ways around this (the thread locals can be explicitly passed in), but this isn't always appropriate or practical. The simplest solution is to spawn new worker threads on demand, and let InheritableThreadLocal do its job.

This brings us back to the question - if I have a high volume site, where user request threads are spawning off half a dozen worker threads each (i.e. not using a thread pool), is this going to give the JVM a problem? We're potentially talking about a couple of hundred new threads being created every second, each one lasting less than a second. Do modern JVMs optimize this well? I remember the days when object pooling was desirable in Java, because object creation was expensive. This has since become unnecessary. I'm wondering if the same applies to thread pooling.

I'd benchmark it, if I knew what to measure, but my fear is that the problems may be more subtle than can be measured with a profiler.

Note: the wisdom of using thread locals is not the issue here, so please don't suggest that I not use them.

627

asked Jan 22 '10 12:01

skaffman

1 Answers

Here is an example microbenchmark:

public class ThreadSpawningPerformanceTest { static long test(final int threadCount, final int workAmountPerThread) throws InterruptedException {     Thread[] tt = new Thread[threadCount];     final int[] aa = new int[tt.length];     System.out.print("Creating "+tt.length+" Thread objects... ");     long t0 = System.nanoTime(), t00 = t0;     for (int i = 0; i < tt.length; i++) {          final int j = i;         tt[i] = new Thread() {             public void run() {                 int k = j;                 for (int l = 0; l < workAmountPerThread; l++) {                     k += k*k+l;                 }                 aa[j] = k;             }         };     }     System.out.println(" Done in "+(System.nanoTime()-t0)*1E-6+" ms.");     System.out.print("Starting "+tt.length+" threads with "+workAmountPerThread+" steps of work per thread... ");     t0 = System.nanoTime();     for (int i = 0; i < tt.length; i++) {          tt[i].start();     }     System.out.println(" Done in "+(System.nanoTime()-t0)*1E-6+" ms.");     System.out.print("Joining "+tt.length+" threads... ");     t0 = System.nanoTime();     for (int i = 0; i < tt.length; i++) {          tt[i].join();     }     System.out.println(" Done in "+(System.nanoTime()-t0)*1E-6+" ms.");     long totalTime = System.nanoTime()-t00;     int checkSum = 0; //display checksum in order to give the JVM no chance to optimize out the contents of the run() method and possibly even thread creation     for (int a : aa) {         checkSum += a;     }     System.out.println("Checksum: "+checkSum);     System.out.println("Total time: "+totalTime*1E-6+" ms");     System.out.println();     return totalTime; }  public static void main(String[] kr) throws InterruptedException {     int workAmount = 100000000;     int[] threadCount = new int[]{1, 2, 10, 100, 1000, 10000, 100000};     int trialCount = 2;     long[][] time = new long[threadCount.length][trialCount];     for (int j = 0; j < trialCount; j++) {         for (int i = 0; i < threadCount.length; i++) {             time[i][j] = test(threadCount[i], workAmount/threadCount[i]);          }     }     System.out.print("Number of threads ");     for (long t : threadCount) {         System.out.print("\t"+t);     }     System.out.println();     for (int j = 0; j < trialCount; j++) {         System.out.print((j+1)+". trial time (ms)");         for (int i = 0; i < threadCount.length; i++) {             System.out.print("\t"+Math.round(time[i][j]*1E-6));         }         System.out.println();     } } }

The results on 64-bit Windows 7 with 32-bit Sun's Java 1.6.0_21 Client VM on Intel Core2 Duo E6400 @2.13 GHz are as follows:

Number of threads  1    2    10   100  1000 10000 100000 1. trial time (ms) 346  181  179  191  286  1229  11308 2. trial time (ms) 346  181  187  189  281  1224  10651

Conclusions: Two threads do the work almost twice as fast as one, as expected since my computer has two cores. My computer can spawn nearly 10000 threads per second, i. e. thread creation overhead is 0.1 milliseconds. Hence, on such a machine, a couple of hundred new threads per second pose a negligible overhead (as can also be seen by comparing the numbers in the columns for 2 and 100 threads).

129

answered Sep 23 '22 23:09

Jaan

Related questions
                            
                                Computer Vision - filtering convex hulls and convexity defects with OpenCV
                            
                                Android app crashes after integrating React Native with NoClassDefFoundError: com.facebook.react.bridge.JSPackagerWebSocketClient
                            
                                How to cache in a Blackberry BrowserField
                            
                                Retrieving and setting split window settings for IntelliJ IDEA plugin development
                            
                                Gradle exclude specific files inside dependency
                            
                                Case sensitivity of Java class names
                            
                                Is there an accepted Java equivalent to Python's zip()? [duplicate]
                            
                                How to force a generic type parameter to be an interface?
                            
                                add view over keyboard android
                            
                                What are the similarities between the Java memory model and the C++11 memory model?
                            
                                getExternalStorageDirectory not working
                            
                                NetBeans / Java / New hint: Thread.sleep called in loop
                            
                                How do I implement task prioritization using an ExecutorService in Java 5?
                            
                                Is using the Class instance as a Map key a best practice?
                            
                                Print full call stack on printStackTrace()?
                            
                                Why does the (Oracle) JVM have a fixed upper limit for memory usage (-Xmx)?
                            
                                What is the Python equivalent of Tomcat?
                            
                                Is it really my job to clean up ThreadLocal resources when classes have been exposed to a thread pool?
                            
                                Java for C# developers
                            
                                How to add multiple application.properties files in spring-boot?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Java thread creation overhead

Tags:

java

performance

multithreading

skaffman

People also ask

1 Answers

Jaan

Recent Activity

Donate For Us