Why is Kotlin's map-filter-reduce slower than Java's Stream operations on large inputs?

Tags:

kotlin

A few days ago I created a simple benchmark (without jmh and all of the another specialized stuff, just to measure roughly).

I've found that for the same simple task (iterate through 10 million numbers, square them, filter only even numbers and reduce their sum), Java works much faster. Here's the code:

Kotlin:

fun test() {
    println((0 .. 10_000_000L).map { it * it }
                              .filter { it % 2 == 0L }
                              .reduce { sum, it -> sum + it })
}

Java:

public void test() {
    System.out.println(LongStream.range(0, 10_000_000)
                                 .map(it -> it * it)
                                 .filter(it -> it % 2 == 0)
                                 .reduce((sum, it) -> sum + it)
                                 .getAsLong());
}

I'm using Java version 1.8.0_144 and Kotlin version 1.2.

On my hardware in average it takes 85ms for Java and 4,470ms for Kotlin to execute the corresponding functions. Kotlin works 52 times slower.

I suspect that the Java compiler produces optimized bytecode, but I didn't expected to see such a huge difference. I'm wondering if I'm doing something wrong? How can I compel Kotlin to work faster? I like it because of its syntax, but 52 times is a big difference. And I just wrote Java 8-like code, not the plain old iterative version (which, I believe, will be much faster than given one).

778

asked Jan 18 '18 09:01

the_kaba

1 Answers

When you compare apples to oranges, the results don't tell you much. You compared one API to another API, each having a totally different focus and goals.

Since all of JDK is as much "Kotlin" as the Kotlin-specific additions, I wrote more of an apples-to-apples comparison, which also takes care of some of the "JVM microbenchmark" concerns.

Kotlin:

fun main(args: Array<String>) {
    println("Warming up Kotlin")
    test()
    test()
    test()
    println("Measuring Kotlin")
    val average = (1..10).map {
        measureTimeMillis { test() }
    }.average()
    println("An average Kotlin run took $average ms")
    println("(sum is $sum)")
}

var sum = 0L

fun test() {
    sum += LongStream.range(0L, 100_000_000L)
            .map { it * it }
            .filter { it % 2 == 0L }
            .reduce { sum, it -> sum + it }
            .asLong
}

Java:

public static void main(String[] args) {
    System.out.println("Warming up Java");
    test();
    test();
    test();
    System.out.println("Measuring Java");
    LongSummaryStatistics stats = LongStream.range(0, 10)
                                            .map(i -> measureTimeMillis(() -> test()))
                                            .summaryStatistics();
    System.out.println("An average Java run took " + stats.getAverage() + " ms");
    System.out.println("sum is " + sum);

}

private static long sum;

private static void test() {
    sum += LongStream.range(0, 100_000_000)
                     .map(it -> it * it)
                     .filter(it -> it % 2 == 0)
                     .reduce((sum, it) -> sum + it)
                     .getAsLong();
}

private static long measureTimeMillis(Runnable measured) {
    long start = System.nanoTime();
    measured.run();
    return TimeUnit.NANOSECONDS.toMillis(System.nanoTime() - start);
}

My results:

Warming up Kotlin
Measuring Kotlin
An average Kotlin run took 158.5 ms
(sum is 4276489111714942720)


Warming up Java
Measuring Java
An average Java run took 357.3 ms
sum is 4276489111714942720

Suprised? I was too.

Instead of digging further, trying to figure out this inversion of the expected results, I would like to make this conclusion:

Kotlin's FP extensions on Iterable are there for convenience. In 95% of all use cases you don't care whether it takes 1 or 2 µs to perform a quick map-filter on a list of 10-100 elements.

Java's Stream API is focused on the performance of bulk operations on large data structures. It also offers auto-parallelization towards the same goal (although it almost never actually helps), but its API is crippled and at times awkward due to these concerns. For example, many useful operations which don't happen to parallelize well are just not there, and the whole paradigm of non-terminal vs. terminal operations adds bulk to each and every Streams expression you write.

Let me also address a few more of your statements:

I know that the Java compiler produces optimized bytecode

This is a) not true and b) largely irrelevant because there is (almost) no such thing as "optimized bytecode". Interpreted execution of bytecode is always at least an order of magnitude slower than JIT-compiled native code.

And I just wrote Java 8-like code, not the plain old iterative version (which, I believe, will be much faster than given one).

You mean this?

Kotlin:

fun test() {
    var sum: Long = 0
    var i: Long = 0
    while (i < 100_000_000) {
        val j = i * i
        if (j % 2 == 0L) {
            sum += j
        }
        i++
    }
    total += sum
}

Java:

private static void test() {
    long sum = 0;
    for (long i = 0; i < 100_000_000; i++) {
        long j  = i * i;
        if (j % 2 == 0) {
            sum += j;
        }
    }
    total += sum;
}

These are the results:

Warming up Kotlin
Measuring Kotlin
An average Kotlin run took 150.1 ms
(sum is 4276489111714942720)

Warming up Java
Measuring Java
An average Java run took 153.0 ms
sum is 4276489111714942720

In both languages the performance is almost the same as Kotlin + Streams API above. As said, the Streams API is optimized for performance.

Both kotlinc and javac probably produced very similar bytecode given this straightforward source code, then HotSpot did its work on both the same way.

answered Sep 17 '22 05:09

Marko Topolnik

Related questions
                            
                                Cons'ing a List in Java
                            
                                question on the working of instanceof
                            
                                How to remove sub seconds part of Date object
                            
                                Performance of Java enums
                            
                                GridLayout and number of rows and columns
                            
                                Get generic type for java.util.Map parameter
                            
                                Concatenate Strings in the strings.xml file for Android
                            
                                how to sort an ArrayList in ascending order using Collections and Comparator
                            
                                Using table guava for hashbasedTable
                            
                                How to run a thread repeatedly after some interval
                            
                                Splitting a string on the double pipe(||) using String.split()
                            
                                How to have two constructors with same number of arguments but for different variables in java [closed]
                            
                                Java == behaving ambiguously
                            
                                Showing an Alert Dialog in Java Swing
                            
                                Access a private field for a junit test
                            
                                Instantiating objects when using Spring, for testing vs production
                            
                                Jersey Maven quickstart archetype in Eclipse
                            
                                How to add Section Header in ListView List Item
                            
                                JavaFX Scene Builder path in Ubuntu
                            
                                SQL Error: 1054, SQLState: 42S22 Unknown column in 'field list' error Java Spring Boot Mysql error

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is Kotlin's map-filter-reduce slower than Java's Stream operations on large inputs?

Tags:

java

kotlin

the_kaba

People also ask

1 Answers

Marko Topolnik

Recent Activity

Donate For Us