How many CPUs are needed before Erlang is faster than single-threaded Java [closed]

Tags:

I am currently using Java, I've read a lot about Erlang on the net, and I have 2 big questions:

How much slower (if any) will Erlang be over simple Java?
I'm assuming here that Java is going to be faster from the shootout benchmarks on the net (Erlang doesn't do that well). So, how many more CPUs am I going to need to make Erlang shine over single-threaded Java (in my particular situation, given below)?
After reading around about Erlang for a while I've hit on a number of comments/posts that say that most large Erlang systems contain a good amount of C/C++.
Is this for speed reasons (my assumption) or something else? i.e. Why is this required?

I have read about the number of processors in most machines going up and threading models being hard (I agree) but I am looking to find out when the "line" is going to be crossed so that I can change language/paradigm at the right time.

A bit of background/context:
I am working server-side on Java services which are very CPU-bound and easily made parallel. This is due to, typically, a single incoming update (via TCP) triggering a change to multiple (100s of) outputs.

The calculations are typically pretty simple (few loops, just lots of arithmetic) and the inputs are coming in pretty fast (100/s).

Currently we are running on 4 CPU machines and running multiple services on each (so multi-threading is pretty pointless and Java seems to run faster without the sync blocks, etc required to make it multi-threaded). There is now a strong push for speed and we now have access to 24 processor machines (per process if required) so I am wondering how best to proceed - massively multi-threaded Java or something easier to code, like Erlang.

652

asked Jan 03 '10 23:01

DaveC

1 Answers

since this is a arithmetic heavy workload and you have already done the job of splitting out the code into seperate service processes, you wouldn't gain much from Erlang. Your job seems to fit Java comfortably. Erlang is good at tiny transactions -- such as msg switching or serving static or simple-dynamic web-pages. Not -- inately at enterprise number-crunching or database workload.

However, you could build on external numerical libraries and databases and use Erlang as a MSG switch :D that's what couch-db does :P

-- edit --

If you move your arithmetic operations into an Erlang async-IO driver erlang will be just as good as the language shoot-out stuff -- but with 24 cpu's perhaps it won't matter that much; the erlang database is procedural and thefore quite fast -- this can be exploited in your application updating 100 entities on each transaction.
The erlang runtime system needs to be a mix of C and C++ because (a) the erlang emulator is written in C/C++ (you have to start somewhere), (b) you have to talk to the kernel to do async file io and network io, and (c) certain parts of the system need to be blistering fast --e.g., the backend of the database system (amnesia).

-- discussion --

with 24 CPU's in a 6 core * 4 CPU topology using a shared memory buss -- you have 4 NUMA entities (the CPUs) and one central memory. You need to be wise about the paradigm, the shared-nothing multi-process approach might kill your memory buss.

To get around this you need to create 4 processes with 6 processing threads and bind each processing thread the corresponding core in the corresponding CPU. These 6 threads need to do collaborative multi-threading -- Erlang and Lua have this innately -- Erlang does it in a hard-core way as it has a full-blown scheduler as part of its runtime which it can use to create as many processes as you want.

Now if you were to partition your tasks across the 4 processes (1 per physical CPU) you would be a happy man, however you are running 4 Java VM's doing (presumably) serious work (yuck, for many reasons). The problem needs to be solved with a better ability to slice and dice the problem.

In comes the Erlang OTP system, it was designed for redundant robust networked systems, but now it is moving towards same-machine NUMA-esque CPU's. It already has a kick-ass SMP emulator, and it will become NUMA aware as well soon. With this paradigm of programming you have a much better chance to saturate your powerful servers without killing your bus.

Perhaps this discussion has been theoretical; however, when you get a 8x8 or 16x8 topology you will be ready for it as well. So my answer is when you have more then 2 -- modern -- physical CPU's on your mainboard you should probably consider a better programming paradigm.

As an example of a major product following the discussion here: Microsoft's SQL Server is CPU-Level NUMA-aware in the SQL-OS layer on which the database engine is built.

199

answered Oct 21 '22 10:10

Hassan Syed

Related questions
                            
                                Push notifications with no sound on MIUI
                            
                                Access photos from external storage in Android Q
                            
                                Failed to execute goal org.codehaus.mojo:exec-maven-plugin:1.5.0:exec
                            
                                Docker WARNING: Published ports are discarded when using host network mode
                            
                                Issue with CORS and error and Access-Control-Allow-Origin header
                            
                                Error during the creating of custom processors apache nifi
                            
                                Good pattern or framework for adding auditing to an existing app? [closed]
                            
                                Can I Define Exceptions to Eclipse cleanup rules?
                            
                                How to set up Eclipse TPTP
                            
                                Hibernate Delete Cascade
                            
                                What is the best free plugin for Eclipse that allows for formatting/indenting/cleanup of JSP code? [closed]
                            
                                Embedding web browser window in Java
                            
                                Simple Java MIDI example not producing any sound
                            
                                Entitymanager causing memory leak?
                            
                                Binary Difference in Zip/Jar file
                            
                                Does Java optimize method calls via an interface which has a single implementor marked as final?
                            
                                hprof file format
                            
                                Java Generics: Why Does Map.get() Ignore Type? [duplicate]
                            
                                Apache CXF: How to secure JAX-RS web service with basic authentication
                            
                                Dynamically binding lists with Spring's form tag

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How many CPUs are needed before Erlang is faster than single-threaded Java [closed]

Tags:

java

performance

multithreading

erlang

multicore

DaveC

People also ask

1 Answers

Hassan Syed

Recent Activity

Donate For Us