Taken from the Apache <code>TreeList</code> doc: <blockquote> The following relative performance statistics are indicative of this class: <pre class="prettyprint"><code> get add insert iterate remove TreeList 3 5 1 2 1 ArrayList 1 1 40 1 40 LinkedList 5800 1 350 2 325 </code></pre> </blockquote> It goes on to say: <blockquote> <code>LinkedList</code> is rarely a good choice of implementation. <code>TreeList</code> is almost always a good replacement for it, although it does use sligtly more memory. </blockquote> My questions are: <ul> <li>What is with the <code>ArrayList</code> <code>add</code>, <code>insert</code>, and <code>remove</code> times crushing <code>LinkedList</code>? Should we expect, for one, that real-world insertion and removal cases greatly favor <code>ArrayList</code>?</li> <li>Does this <code>TreeList</code> simply put the nail in the coffin of the venerable <code>LinkedList</code>?</li> </ul> I am tempted to conclude they have amortized or ignored <code>ArrayList</code>'s growing pains, and have not taken into consideration the insertion and removal times for an item in a <code>LinkedList</code> that has already been located.

For the ArrayList, since it is done infrequently, you can basically have that cost be negligible. If it is actually a problem, then just make the array larger to start with. If I have a small list then a LinkedList makes sense to use as there is minimal benefit at that point. If the list is going to be long then obviously a TreeList makes more sense. If I am going to be doing a great deal of random access to a list then the ArrayList makes more sense. Which container to use really depends on what you will be doing with it. There is no one correct container, as each has their strengths and weaknesses, and with experience you start to get an understanding of when to use which one.

List implementations: does LinkedList really perform so poorly vs. ArrayList and TreeList?

Tags:

java

collections

arraylist

linked-list

treelist

Taken from the Apache TreeList doc:

The following relative performance statistics are indicative of this class:

             get  add  insert  iterate  remove
 TreeList       3    5       1       2       1
 ArrayList      1    1      40       1      40
 LinkedList  5800    1     350       2     325

It goes on to say:

LinkedList is rarely a good choice of implementation. TreeList is almost always a good replacement for it, although it does use sligtly more memory.

My questions are:

What is with the ArrayList add, insert, and remove times crushing LinkedList? Should we expect, for one, that real-world insertion and removal cases greatly favor ArrayList?
Does this TreeList simply put the nail in the coffin of the venerable LinkedList?

I am tempted to conclude they have amortized or ignored ArrayList's growing pains, and have not taken into consideration the insertion and removal times for an item in a LinkedList that has already been located.

631

asked Nov 11 '09 05:11

wsorenson

6 Answers

The key thing here is the complexity of insert/delete operations in the three List implementations. ArrayList has O(n) insert/delete times for arbitrary indices, but it is O(1) if the operation is at the end of the list. ArrayList also has the convenience of O(1) access for any location. LinkedList is similarly O(n), but is O(1) for operations at either end of the List (start and end) and O(n) access for arbitrary positions. TreeList has O(logn) complexity for all operations at any position.

This clearly shows that TreeList is faster for large enough Lists as far as insert/deletes in arbitrary positions are concerned. But AFAIK, TreeLists are implemented as a binary search tree, and has a much bigger constant associated with its O(logn) operations than similar operations with ArrayLists which are simply wrappers around an array. This makes TreeList actually slower for small Lists. Also, if all you are doing is appending element into a List, the O(1) performance of ArrayList/LinkedList is clearly faster. Moreover, often the number of insert/deletes are much fewer than the numbers of accesses, which tends to make ArrayList faster overall for many cases. LinkedList's constant time insert/delete at the either end of the List makes it much faster at implementing data structures like Queues, Stacks and Deques.

At the end of the day, it all depends on what exactly you need a List for. There isn't a one-size-fits-all solution. You have to select the implementation most suitable for your job.

116

answered Oct 21 '22 05:10

MAK

It's due to the data structures behind these Collections. TreeList is a tree, which allows for relatively fast reads, insertions, removals (all O(log n)). The ArrayList uses an array to store the data, so when you insert or remove, every item in the array has to be shifted up or down (O(n) worst case). Arrays also have a fixed size, so if it overflows the current array's capacity, a new, larger one (usually double the size of the last one, to keep resizes to a minimum) must be created. LinkedList used... a linked list. A linked list usually has a reference to the first (and sometimes last) elements in the list. Then each element in the list has a refrence to either the next element in the list (for a singly-linked list) or the next and previous elements (for a double linked list). Because of this, to access a specific element, you must iterate through every element before it to get there (O(n) worst case). When inserting or removing specific elements, you must find the position to insert or remove them from, which takes time (O(n) worst case). However there is very little cost to simply adding another element to the beginning or end (O(1)).

There are whole books written on data structures and when to use them, I recommend reading up on the more fundamental ones.

answered Oct 21 '22 05:10

David Brown

Because a linked list has to navigate node by node to get anywhere in the list (save the front and probably the back depending on implementation) it makes sense that the numbers are so high.

For add/insert/remove in a large LinkedList you would have a lot of hopping from node to node to get to the correct spot.

If they made the ArrayList of the proper size to start with the growing pains will be nothing. If the ArrayList is small the growing pains don't matter.

For the LinkedList if the operations are all near the front of the list it would impact far less then if they are at the end.

What you should do is always use the interface, eg: List when declaring variables and parameters then you can change the "new LinkedList();" to "new ArrayList();" and profile the code to see how it performs in your specific code.

Because of the speed of not having to hop from node to node I always default to ArrayList instead of LinkedList.

I would believe the tree list is going to be significantly faster than both (even without looking at the code). Trees are designed to be fast.

answered Oct 21 '22 04:10

TofuBeer

Each and every one person who answered here is correct. They all are right in their notion, that it depends very heavily on your usage pattern, i.e there is no one-size-fits-all List. But at the moment of my writing they all forgot to mention (either that, or I am sloppy reader) a use-case when LinkedList is at the best: iterator-positioned insert. That means, that if you're doing not just

LinkedList::add(int index, E element) 
          Inserts the specified element at the specified position in this list.

which seem to be the method they used to obtain the statistics, but

iterator.insert(E element)

with an iterator obtained through either

public abstract ListIterator<E> listIterator(int index)
Returns a list-iterator of the elements in this list (in proper sequence), starting at the specified position in the list.

public Iterator<E> iterator()
Returns an iterator over the elements in this list (in proper sequence).

, then you're bound to get the best arbitrary insertion performance ever. That implies of course, that you're able to limit number of calls to iterator() and listIterator(), and number of iterator's movements through the list (e.g you can do only one sequential pass over the list to do all inserts you need). This makes its use-cases quite limited in their number, but nevertheless they are the ones that occur very very often. And LinkedList's performance in them is the reason why it is (and gonna be in the future) being kept in all container collections of all languages, not just Java.

PS. All of the above of course applies to all other operations, like get(), remove(), etc. I.e carefully designed access through iterator will make all of them O(1) with a very small actual constant. The same of course can be said for all other Lists, i.e iterator access will speed them all up (however slightly). But not ArrayList's insert() and remove() - they're still going to be O(n)... And not TreeList's insert() and remove() - tree balancing overhead is not something you can avoid... And TreeList probably has more memory overhead... You get my idea. To sum it all up, LinkedList is for small hi-perf scan-like operations over lists. Whether that is the use-case you need or not - only you can tell.

PSS. That said, I'm therefore also remain

tempted to conclude they have amortized or ignored ArrayList's growing pains, and have not taken into consideration the insertion and removal times for an item in a LinkedList that has already been located.

answered Oct 21 '22 04:10

nightingale

For the ArrayList, since it is done infrequently, you can basically have that cost be negligible. If it is actually a problem, then just make the array larger to start with.

If I have a small list then a LinkedList makes sense to use as there is minimal benefit at that point. If the list is going to be long then obviously a TreeList makes more sense.

If I am going to be doing a great deal of random access to a list then the ArrayList makes more sense.

Which container to use really depends on what you will be doing with it. There is no one correct container, as each has their strengths and weaknesses, and with experience you start to get an understanding of when to use which one.

answered Oct 21 '22 05:10

James Black

Note that ArrayList is generally faster than LinkedList, even when your code calls just the methods that are constant time for both. For example, ArrayList.add() simplies copies a single variable and increments a counter when no resizing is needed, while LinkedList.add() must also create a node and set multiple pointers. In addition, the LinkedList nodes require more memory, which slows down your application, and garbage collection must deal with the nodes.

If you need to add or remove elements from either end of the list, but don't require random access, an ArrayDeque is faster than than a LinkedList, though it requires Java 6.

A LinkedList make sense for iterating across the list and then adding or removing elements in the middle, but that's an unusual situation.

answered Oct 21 '22 04:10

Jared Levy

Related questions
                            
                                The visibility of variable which write after volatile variable write
                            
                                MissingResourceException: Can't find bundle for base name resources.controls.controls_res, locale en
                            
                                How to convert camel case to lower case with underscores in a REST API?
                            
                                How to write a function to find a value bigger than N in parallel
                            
                                Hibernate @Filter does not work with Spring JpaRepository.findById method
                            
                                Generic method with redundant type parameter
                            
                                Intellij "cannot resolve symbol" after installing spring boot
                            
                                Java "'java.execute.workspaceCommand' failed" in VSCode
                            
                                Java Spliterator Continually Splits Parallel Stream
                            
                                UUID primary key for JPA Entity: safe approach to use unique values on multiple instances
                            
                                Proper way to wait for List<CompletableFuture<Void>> to indicate all operations have finished
                            
                                What is junit-bom and junit platform for, and should I include them in gradle dependencies?
                            
                                org.springframework.boot Configuration with name 'runtime' not found
                            
                                Spring Boot 2.5.0, Spring Cloud 2020.0.2 and Hibernate 5.4.31 - H2 Database Multi Row Insert Failing
                            
                                Constant time for multiplication in Galois Field GF(4)
                            
                                Selenium webdriver- get performance logs- unknown date timestamp (12345.12345)
                            
                                Java 11 doesn't run main method if the public class is not declared first
                            
                                What is the max size of a smart contract on the RSK network?
                            
                                Spring server.forward-headers-strategy NATIVE vs FRAMEWORK
                            
                                Is there a Spring Boot flag to disable lenient Jackson parsing of LocalDate values with times?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

List implementations: does LinkedList really perform so poorly vs. ArrayList and TreeList?

Tags:

java

collections

arraylist

linked-list

treelist

wsorenson

People also ask

6 Answers

MAK

David Brown

TofuBeer

nightingale

James Black

Jared Levy

Recent Activity

Donate For Us