The behaviour is (somewhat) documented in the javadoc: <blockquote> This implementation determines which is the smaller of this set and the specified collection, by invoking the size method on each. If this set has fewer elements, then the implementation iterates over this set, checking each element returned by the iterator in turn to see if it is contained in the specified collection. If it is so contained, it is removed from this set with the iterator's remove method. If the specified collection has fewer elements, then the implementation iterates over the specified collection, removing from this set each element returned by the iterator, using this set's remove method. </blockquote> What this means in practice, when you call <code>source.removeAll(removals);</code>: <ul> <li>if the <code>removals</code> collection is of a smaller size than <code>source</code>, the <code>remove</code> method of <code>HashSet</code> is called, which is fast.</li> <li>if the <code>removals</code> collection is of equal or larger size than the <code>source</code>, then <code>removals.contains</code> is called, which is slow for an ArrayList.</li> </ul> Quick fix: <pre class="prettyprint"><code>Collection<Integer> removals = new HashSet<Integer>(); </code></pre> Note that there is an open bug that is very similar to what you describe. The bottom line seems to be that it is probably a poor choice but can't be changed because it is documented in the javadoc. <hr> For reference, this is the code of <code>removeAll</code> (in Java 8 - haven't checked other versions): <pre class="prettyprint"><code>public boolean removeAll(Collection<?> c) { Objects.requireNonNull(c); boolean modified = false; if (size() > c.size()) { for (Iterator<?> i = c.iterator(); i.hasNext(); ) modified |= remove(i.next()); } else { for (Iterator<?> i = iterator(); i.hasNext(); ) { if (c.contains(i.next())) { i.remove(); modified = true; } } } return modified; } </code></pre>

The HashSet<T>.removeAll method is surprisingly slow

Tags:

java

performance

collections

hashset

The behaviour is (somewhat) documented in the javadoc:

This implementation determines which is the smaller of this set and the specified collection, by invoking the size method on each. If this set has fewer elements, then the implementation iterates over this set, checking each element returned by the iterator in turn to see if it is contained in the specified collection. If it is so contained, it is removed from this set with the iterator's remove method. If the specified collection has fewer elements, then the implementation iterates over the specified collection, removing from this set each element returned by the iterator, using this set's remove method.

What this means in practice, when you call source.removeAll(removals);:

if the removals collection is of a smaller size than source, the remove method of HashSet is called, which is fast.
if the removals collection is of equal or larger size than the source, then removals.contains is called, which is slow for an ArrayList.

Quick fix:

Collection<Integer> removals = new HashSet<Integer>();

Note that there is an open bug that is very similar to what you describe. The bottom line seems to be that it is probably a poor choice but can't be changed because it is documented in the javadoc.

For reference, this is the code of removeAll (in Java 8 - haven't checked other versions):

public boolean removeAll(Collection<?> c) {
    Objects.requireNonNull(c);
    boolean modified = false;

    if (size() > c.size()) {
        for (Iterator<?> i = c.iterator(); i.hasNext(); )
            modified |= remove(i.next());
    } else {
        for (Iterator<?> i = iterator(); i.hasNext(); ) {
            if (c.contains(i.next())) {
                i.remove();
                modified = true;
            }
        }
    }
    return modified;
}

Related questions
                            
                                How to get the number of columns from a JDBC ResultSet?
                            
                                How to set Spring profile from system variable?
                            
                                How to convert a Java object (bean) to key-value pairs (and vice versa)?
                            
                                Check instanceof in stream
                            
                                Regex for converting CamelCase to camel_case in java
                            
                                Java HTTP Client Request with defined timeout
                            
                                "Integer number too large" error message for 600851475143
                            
                                Handler is abstract ,cannot be instantiated
                            
                                Hidden features of Eclipse [closed]
                            
                                Reason for calling shutdown() on ExecutorService
                            
                                Android N Java 8 features (Jack compiler) and Kotlin interop
                            
                                Multiple Java versions running concurrently under Windows
                            
                                How is an overloaded method chosen when a parameter is the literal null value?
                            
                                Is there a practical use for weak references? [duplicate]
                            
                                Open Source Java Profilers [closed]
                            
                                How do I increase the number of displayed lines of a Java stack trace dump?
                            
                                Uninitialized Object vs Object Initialized to NULL
                            
                                Transitive dependencies not resolved for aar library using gradle
                            
                                Convert an array into an ArrayList [duplicate]
                            
                                Java 32-bit vs 64-bit compatibility

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With