In my examples theoretically performance of 2 methods should be pretty similar. In the first case I use array, at the second - ArrayList with ensured capacity. The results is the following: <blockquote> LessonBenchmark2.capacityTestArray avgt 5 1,354 ± 0,057 ms/op </blockquote> <blockquote> LessonBenchmark2.capacityTestArrayListEnsured avgt 5 32,018 ± 81,911 ms/op </blockquote> Here it seems that array is much faster (1.354 vs 32.018 ms/op). It might be that the settings of my benchmark with JMH is not correct. How to make it right? Also if I use @Setup(Level.Invocation), then the results are close (1,405 vs 1,496 ms/op): <blockquote> LessonBenchmark.capacityTestArray avgt 5 1,405 ± 0,143 ms/op LessonBenchmark.capacityTestArrayListEnsured avgt 5 1,496 ± 0,104 ms/op </blockquote> However it is said to use Invocation with care. Also Iteration mode seems logically right. Here is the code: <pre class="prettyprint"><code>public static void main(String[] args) throws Exception { org.openjdk.jmh.Main.main(args); } static final int iter = 5; static final int fork = 1; static final int warmIter = 5; @State(Scope.Benchmark) public static class Params { public int length = 100_000; public Person[] people; public ArrayList<Person> peopleArrayListEnsure; // before each iteration of the benchmark @Setup(Level.Iteration) public void setup() { people = new Person[length]; peopleArrayListEnsure = new ArrayList<>(length); } } @Benchmark @Warmup(iterations = warmIter) @BenchmarkMode(Mode.AverageTime) @OutputTimeUnit(TimeUnit.MILLISECONDS) @Fork(value = fork) @Measurement(iterations = iter) public void capacityTestArray(Params p) { for (int i = 0; i < p.length; i++) { p.people[i] = new Person(i, new Address(i, i), new Pet(i, i)); } } @Benchmark @Warmup(iterations = warmIter) @BenchmarkMode(Mode.AverageTime) @OutputTimeUnit(TimeUnit.MILLISECONDS) @Fork(value = fork) @Measurement(iterations = iter) public void capacityTestArrayListEnsured(Params p) { for (int i = 0; i < p.length; i++) { p.peopleArrayListEnsure.add(new Person(i, new Address(i, i), new Pet(i, i))); } } public static class Person { private int id; private Address address; private Pet pet; public Person(int id, Address address, Pet pet) { this.id = id; this.address = address; this.pet = pet; } } public static class Address { private int countryId; private int cityId; public Address(int countryId, int cityId) { this.countryId = countryId; this.cityId = cityId; } } public static class Pet { private int age; private int typeId; public Pet(int age, int typeId) { this.age = age; this.typeId = typeId; } } </code></pre>

As soon as you understand the difference between <code>Trial</code>, <code>Iteration</code> and <code>Invocation</code>, your question becomes very easy to answer. And what place to better understand these then the samples themselves. <code>Invocation</code> is the a single execution of the method. Let's say there are 3 threads and each execute this benchmark method 100 times. This means <code>Invocation == 300</code>. That is why you get very similar results using this as the set-up. <code>Iteration</code> would be <code>3</code> from the example above. <code>Trial</code> would be <code>1</code>, when all the threads execute all their methods. <code>Invocation</code>, though has a scary documentation has its usage, like a sorted data structure; but I've used in various other places too. Also the notion of <code>operation</code> can be "altered" with <code>@OperationsPerInvocation</code> - which is another sharp tool. <hr> Armed with this - it gets easy to answer. When you use <code>Iteration</code>, your <code>ArrayList</code> will grow constantly - which internally means <code>System::arrayCopy</code>, while your array does not. Once you figure this out, you need to read the samples and see that your second problem is that your <code>@Benchmark</code> methods return <code>void</code>. And, contrary, to the other answer - I would not suggest to bulk everything with the test method itself, but this raises the question on what do you want to test, to begin with. Do not forget that these are just numbers, in the end, you need to reason about what they mean and how to properly set-up a <code>JMH</code> test.

Even if initially thought it was a natural performance difference, below's comment were right <hr> As commented below, the difference is indeed higher than expected. The only scenario in which the <code>add()</code> goes from <code>O(1)</code> to <code>O(n)</code> is if it grows. May it be that the tests are reusing the same arraylist (as result of setup not being called more than once)? This would only affect to the arraylist test, as the array would just override the values. Just to be sure the arraylist isn't growing: <pre class="prettyprint"><code>public void capacityTestArrayListEnsured(Params p) { p.peopleArrayListEnsure = new ArrayList<>(p.length); //or clear()? for (int i = 0; i < p.length; i++) p.peopleArrayListEnsure.add(new Person(i, new Address(i, i), new Pet(i, i))); } </code></pre> In order to make it fair, you could also initialize the array in the other method so the elapsed times are equally added: <pre class="prettyprint"><code>public void capacityTestArray(Params p) { p.people = new Person[p.length]; for (int i = 0; i < p.length; i++) p.people[i] = new Person(i, new Address(i, i), new Pet(i, i)); } </code></pre>

How to use JMH properly? Example with ArrayList

Tags:

java

performance

jmh

In my examples theoretically performance of 2 methods should be pretty similar. In the first case I use array, at the second - ArrayList with ensured capacity.

The results is the following:

LessonBenchmark2.capacityTestArray avgt 5 1,354 ± 0,057 ms/op

LessonBenchmark2.capacityTestArrayListEnsured avgt 5 32,018 ± 81,911 ms/op

Here it seems that array is much faster (1.354 vs 32.018 ms/op). It might be that the settings of my benchmark with JMH is not correct. How to make it right?

Also if I use @Setup(Level.Invocation), then the results are close (1,405 vs 1,496 ms/op):

LessonBenchmark.capacityTestArray avgt 5 1,405 ± 0,143 ms/op

LessonBenchmark.capacityTestArrayListEnsured avgt 5 1,496 ± 0,104 ms/op

However it is said to use Invocation with care. Also Iteration mode seems logically right.

Here is the code:

public static void main(String[] args) throws Exception {
    org.openjdk.jmh.Main.main(args);
}

static final int iter = 5;
static final int fork = 1;
static final int warmIter = 5;

@State(Scope.Benchmark)
public static class Params {
    public int length = 100_000;
    public Person[] people;
    public ArrayList<Person> peopleArrayListEnsure;

    // before each iteration of the benchmark
    @Setup(Level.Iteration)
    public void setup() {
        people = new Person[length];
        peopleArrayListEnsure = new ArrayList<>(length);
    }
}

@Benchmark
@Warmup(iterations = warmIter)
@BenchmarkMode(Mode.AverageTime)
@OutputTimeUnit(TimeUnit.MILLISECONDS)
@Fork(value = fork)
@Measurement(iterations = iter)
public void capacityTestArray(Params p) {
    for (int i = 0; i < p.length; i++) {
        p.people[i] = new Person(i, new Address(i, i), new Pet(i, i));
    }
}

@Benchmark
@Warmup(iterations = warmIter)
@BenchmarkMode(Mode.AverageTime)
@OutputTimeUnit(TimeUnit.MILLISECONDS)
@Fork(value = fork)
@Measurement(iterations = iter)
public void capacityTestArrayListEnsured(Params p) {
    for (int i = 0; i < p.length; i++) {
        p.peopleArrayListEnsure.add(new Person(i, new Address(i, i), new Pet(i, i)));
    }
}

public static class Person {
    private int id;
    private Address address;
    private Pet pet;

    public Person(int id, Address address, Pet pet) {
        this.id = id;
        this.address = address;
        this.pet = pet;
    }
}

public static class Address {
    private int countryId;
    private int cityId;

    public Address(int countryId, int cityId) {
        this.countryId = countryId;
        this.cityId = cityId;
    }
}

public static class Pet {
    private int age;
    private int typeId;

    public Pet(int age, int typeId) {
        this.age = age;
        this.typeId = typeId;
    }
}

657

asked Feb 17 '21 13:02

Kirill Ch

3 Answers

The test is badly designed; in your test, because the arraylist is created only once for multiple invocations, the array-based code just overwrites the same array a bunch of times, whereas the arraylist version adds more and more, and needs to grow.

One trivial fix is to clear it first. Another fix is to stop using state here and just make the creation of the object (be it the 100k person array, or the person arraylist, presized for 100k persons) part of the test harness. Once you take care of this, the results are the exact same taking into account the error, there is no performance different at all between arrays and arraylists for this.

MyBenchmark.capacityTestArray             avgt    5  1,325 ± 0,059  ms/op
MyBenchmark.capacityTestArrayListEnsured  avgt    5  1,287 ± 0,157  ms/op

I simplified by removing the Params state entirely, and making the creation of the list and array part of each test's outlay:

    static final int LEN = 100_000;
    
    public void capacityTestArray() {
        Person[] people = new Person[LEN];
        for (int i = 0; i < LEN; i++) {
            people[i] = new Person(i, new Address(i, i), new Pet(i, i));
        }
    }

    public void capacityTestArrayListEnsured() {
        List<Person> p = new ArrayList<Person>(LEN);
        for (int i = 0; i < LEN; i++) {
            p.add(new Person(i, new Address(i, i), new Pet(i, i)));
        }
    }

(keeping all annotations and the Person, Address, etc classes the same).

Alternatively, take your existing code and just toss a list.clear() at the top.

answered Oct 24 '22 10:10

rzwitserloot

As soon as you understand the difference between Trial, Iteration and Invocation, your question becomes very easy to answer. And what place to better understand these then the samples themselves.

Invocation is the a single execution of the method. Let's say there are 3 threads and each execute this benchmark method 100 times. This means Invocation == 300. That is why you get very similar results using this as the set-up.

Iteration would be 3 from the example above.

Trial would be 1, when all the threads execute all their methods.

Invocation, though has a scary documentation has its usage, like a sorted data structure; but I've used in various other places too. Also the notion of operation can be "altered" with @OperationsPerInvocation - which is another sharp tool.

Armed with this - it gets easy to answer. When you use Iteration, your ArrayList will grow constantly - which internally means System::arrayCopy, while your array does not.

Once you figure this out, you need to read the samples and see that your second problem is that your @Benchmark methods return void. And, contrary, to the other answer - I would not suggest to bulk everything with the test method itself, but this raises the question on what do you want to test, to begin with. Do not forget that these are just numbers, in the end, you need to reason about what they mean and how to properly set-up a JMH test.

answered Oct 24 '22 10:10

Eugene

Even if initially thought it was a natural performance difference, below's comment were right

As commented below, the difference is indeed higher than expected.

The only scenario in which the add() goes from O(1) to O(n) is if it grows. May it be that the tests are reusing the same arraylist (as result of setup not being called more than once)? This would only affect to the arraylist test, as the array would just override the values.

Just to be sure the arraylist isn't growing:

public void capacityTestArrayListEnsured(Params p) 
{
    p.peopleArrayListEnsure = new ArrayList<>(p.length); //or clear()?
    for (int i = 0; i < p.length; i++) 
        p.peopleArrayListEnsure.add(new Person(i, new Address(i, i), new Pet(i, i)));
}

In order to make it fair, you could also initialize the array in the other method so the elapsed times are equally added:

public void capacityTestArray(Params p)  
{
    p.people = new Person[p.length];
    for (int i = 0; i < p.length; i++) 
        p.people[i] = new Person(i, new Address(i, i), new Pet(i, i));
}

answered Oct 24 '22 11:10

aran

Related questions
                            
                                Why does Collections.unmodifiableMap not check if the map passed is already an UnmodifiableMap?
                            
                                Java semantics - Is there a way to write this better?
                            
                                What is happening when I add a char and a String in Java?
                            
                                Multiline lambda
                            
                                Eclipse Installation on macOS
                            
                                Why when i try push my Spring boot app to heroku it return "Fatal error compiling: invalid target release: 11"
                            
                                gradle with openjdk 14 Unsupported class file major version 58
                            
                                How to subscribe to multiple Google PubSub Projects in Spring GCP?
                            
                                Office 365 XOAUTH2 for IMAP and SMTP Authentication fails
                            
                                Searching users in Keycloak from Java code
                            
                                'Fork' git repository as dependency in gradle
                            
                                No enum constant org.gradle.api.JavaVersion.VERSION_14
                            
                                Process Json Array concurrently as well as in order as fast in Java
                            
                                Java HashMap containsKey [duplicate]
                            
                                What is the difference between Clock.systemUTC() and Clock.systemDefaultZone()?
                            
                                Select few among all the nested entities : SPRING JPA
                            
                                Why context.startActivity(intent) not starting the activity and how to handle exception in android?
                            
                                Why does @Transactional isolation level have no effect when updating entities with Spring Data JPA?
                            
                                spring data mongodb calling save twice leads to duplicate key exception
                            
                                Fast MultiMap in Multi-Thread Environments

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With