In the code below, I needed to fetch an element, any element, from toSearch. I was unable to find a useful method on the Set interface definition to return just a single (random, but not required to be random) member of the set. So, I used the toArray()[0] technique (present in the code below). <pre class="prettyprint"><code>private Set<Coordinate> floodFill(Value value, Coordinate coordinateStart) { Set<Coordinate> result = new LinkedHashSet<Coordinate>(); Set<Coordinate> toSearch = new LinkedHashSet<Coordinate>(); toSearch.add(coordinateStart); while (toSearch.size() > 0) { Coordinate coordinate = (Coordinate)toSearch.toArray()[0]; result.add(coordinate); toSearch.remove(coordinate); for (Coordinate coordinateAdjacent: getAdjacentCoordinates(coordinate)) { if (this.query.getCoordinateValue(coordinateAdjacent) == value) { if (!result.contains(coordinateAdjacent)) { toSearch.add(coordinateAdjacent); } } } } return result; } </code></pre> The other technique I have seen discussed is to replace "(Coordinate)toSearch.toArray()[0]" with "toSearch.iterator().next()". Which technique, toArray() or iterator(), is the most likely to execute the most quickly with the least GC (Garbage Collection) impact? My intuition (after composing this question) is that the second technique using the Iterator will be both faster in execution and lower overhead for the GC. Given I don't know the implementation of the Set being passed (assuming HashSet or LinkedHashSet as most likely), how much overhead is incurred in each of the toArray() or iterator() methods? Any insights on this would be greatly appreciated. Questions (repeated from above): <ol> <li>Which technique, toArray() or iterator(), is the most likely to execute the most quickly with the least GC (Garbage Collection) impact?</li> <li>Given I don't know the implementation of the Set being passed (assuming HashSet or LinkedHashSet as most likely), how much overhead is incurred in each of the toArray() and iterator() methods?</li> </ol>

<code>toSearch.iterator().next()</code> will be faster and less memory-intensive because it does not need to copy any data, whereas <code>toArray</code> will allocate and copy the contents of the set into the array. This is irrespective of the actual implementation: <code>toArray</code> will always have to copy data.

In Java (1.5 or later), what is the best performing way to fetch an (any) element from a Set?

Tags:

java

performance

iterator

set

toarray

In the code below, I needed to fetch an element, any element, from toSearch. I was unable to find a useful method on the Set interface definition to return just a single (random, but not required to be random) member of the set. So, I used the toArray()[0] technique (present in the code below).

private Set<Coordinate> floodFill(Value value, Coordinate coordinateStart)
{
    Set<Coordinate> result = new LinkedHashSet<Coordinate>();

    Set<Coordinate> toSearch = new LinkedHashSet<Coordinate>();
    toSearch.add(coordinateStart);
    while (toSearch.size() > 0)
    {
        Coordinate coordinate = (Coordinate)toSearch.toArray()[0];
        result.add(coordinate);
        toSearch.remove(coordinate);
        for (Coordinate coordinateAdjacent: getAdjacentCoordinates(coordinate))
        {
            if (this.query.getCoordinateValue(coordinateAdjacent) == value)
            {
                if (!result.contains(coordinateAdjacent))
                {
                    toSearch.add(coordinateAdjacent);
                }
            }
        }
    }

    return result;
}

The other technique I have seen discussed is to replace "(Coordinate)toSearch.toArray()[0]" with "toSearch.iterator().next()". Which technique, toArray() or iterator(), is the most likely to execute the most quickly with the least GC (Garbage Collection) impact?

My intuition (after composing this question) is that the second technique using the Iterator will be both faster in execution and lower overhead for the GC. Given I don't know the implementation of the Set being passed (assuming HashSet or LinkedHashSet as most likely), how much overhead is incurred in each of the toArray() or iterator() methods? Any insights on this would be greatly appreciated.

Questions (repeated from above):

Which technique, toArray() or iterator(), is the most likely to execute the most quickly with the least GC (Garbage Collection) impact?
Given I don't know the implementation of the Set being passed (assuming HashSet or LinkedHashSet as most likely), how much overhead is incurred in each of the toArray() and iterator() methods?

446

asked Dec 04 '10 23:12

chaotic3quilibrium

2 Answers

toSearch.iterator().next() will be faster and less memory-intensive because it does not need to copy any data, whereas toArray will allocate and copy the contents of the set into the array. This is irrespective of the actual implementation: toArray will always have to copy data.

102

answered Sep 20 '22 01:09

Cameron Skinner

From what I can see you are doing Breadth First Search

Below is the example how it could be implemented without using toArray:

    private Set<Coordinate> floodFill(Value value, Coordinate coordinateStart) {
    final Set<Coordinate> visitedCoordinates = new LinkedHashSet<Coordinate>();
    final Deque<Coordinate> deque = new ArrayDeque<Coordinate>();

    deque.push(coordinateStart);

    while (!deque.isEmpty()) {
        final Coordinate currentVertex = deque.poll();
        visitedCoordinates.add(currentVertex);
        for (Coordinate coordinateAdjacent : getAdjacentCoordinates(currentVertex)) {
            if (this.query.getCoordinateValue(coordinateAdjacent) == value) {
                if (!visitedCoordinates.contains(coordinateAdjacent)) {
                    deque.add(coordinateAdjacent);
                }
            }
        }
    }

    return visitedCoordinates;
}

Implementation notes:

And now I am concerned that the contains() method's implementation on LinkedList could be doing up to a a full scan of the contents before returning the answer.

You are right about full scan (aka linear search). Nevertheless, In your case it's possible to have additional set for tracking already visited vertexes(btw, actually it's your result!), that would solve issue with contains method in O(1) time.

Cheers

answered Sep 20 '22 01:09

Petro Semeniuk

Related questions
                            
                                Something like unnecessary code detector for NetBeans
                            
                                How can I put a "(de)select all" check box in an SWT Table header?
                            
                                Eclipse RCP: Target platform - Eclipse vs. Equinox?
                            
                                Reading a zip file within a jar file
                            
                                jndi database connection with jpa and eclipselink
                            
                                webm / vp8 player for java
                            
                                Java |= operator question [duplicate]
                            
                                How to insert 'sub-rows' into a Wicket DataTable
                            
                                Include jar file when creating an R package
                            
                                How can I read file from classes directory in my WAR?
                            
                                How to parse and modify HTML file in Java
                            
                                How can I configure Hibernate to immediately apply all saves, updates, and deletes?
                            
                                Is there a java library that converts strings describing measures of time (e.g. "1d 1m 1s") to milliseconds?
                            
                                Java: Apply Callback to Array Values
                            
                                java: how to both read and write to & from process thru pipe (stdin/stdout)
                            
                                "Timeout while fetching" URLFetch GAE/J
                            
                                executing commands on terminal in linux through java
                            
                                Dynamically generate java sources (without xjc)
                            
                                Deployment problem on JBOSS server 5.0.1.GA
                            
                                Persist collection in object with MyBatis

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With