Does anyone know a good resource to concisely explain the different types of lists available in C# and when their usage is appropriate? For example, List, Hashtable, Dictionaries etc. I'm never quite sure when I should be using what.

These aren't all lists, although they're all collections. Here's a quick summary. Non-generic collections (API is in terms of <code>object</code>. Values types are boxed. These are mostly in the System.Collections namespace: <ul> <li> ArrayList: A list of items, backed by an array. Fast random read/write access. Fast add to the tail end, if the buffer doesn't need resizing.</li> <li> Hashtable: Map from key to value. Keys are unique, values don't have to be. Uses the GetHashCode method to achieve near O(1) read/write access (aside from nasty cases where all items have the same hash, or the backing store needs rebuilding). Iterating over the key/value pairs gives an unpredictable order. (Well, effectively unpredictable.)</li> <li> SortedList: Like a Hashtable, but the entries are always returned in sorted-by-key order. Stored as a list of key/value pairs.</li> <li> Stack: Last-in-first-out collection</li> <li> Queue: First-in-first-out collection</li> <li> Array: Fixed-size O(1) random-access; non-generic, but has strongly typed forms as well</li> </ul> Generic collections. (Strongly-typed API, will not box value types (assuming suitable T). These are mostly in the System.Collections.Generic namespace: <ul> <li> List<T>: Like ArrayList</li> <li> Dictionary<TKey, TValue>: like Hashtable</li> <li> SortedList<TKey, TValue>: like SortedList</li> <li> SortedDictionary<TKey, TValue>: like SortedList, but stored as a tree of key/value pairs which gives better performance in many situations. See docs for more detail.</li> <li> LinkedList<T>: Doubly linked list (fast access to head and tail)</li> <li> Stack<T>: Like Stack</li> <li> Queue<T>: Like Queue</li> <li> ReadOnlyCollection<T>: Like List<T> but giving a read-only view</li> </ul> Possibly the most important collection interface is IEnumerable (and IEnumerable<T>). This represents a sequence of items much like a Stream represents a sequence of bytes. There is no random access, just forward-reading. LINQ to Objects is based on this, and pretty much all collection types implement it.

To expound on tobsen's earlier answer, the C5 Generic Collection Library has a large number of, well, collections. I'll describe some of them here: Queue/Stack <ul> <li> <code>CircularQueue<T></code>: This class provides strictly Queue and Stack functionality. As well, efficient O(1) access to any item in the Stack/Queue is available using the indexer: <code>cq[0]</code> (where 0 is the oldest item, next to be dequeued, last to be popped).</li> </ul> Lists Note: <code>ArrayList</code> and <code>LinkedList</code> can also function as Queue/Stacks <ul> <li> <code>ArrayList<T></code>: Similar to its counterpart in <code>System.Collections.Generic (SCG)</code>, <code>List<T></code>, this is backed by an array, guaranteeing O(1) indexing, but worst-case O(n) insertion.O(n) to find an item.</li> <li> <code>LinkedList<T></code>: Similar to its counterpart <code>SCG.LinkedList<T></code>. Using a doubly-linked list, guarantees O(1) insertion, but worst-case O(n) indexing (in practice, is proportional to distance from either head or tail of the list). Also O(n) to find an item. Sorting uses a stable Merge Sort.</li> <li> <code>HashedArrayList<T></code>: Similar to the <code>ArrayList<T></code> above, but does not allow duplicates. The benefit you get in return is that the time to find an item and its index is reduced to O(1).</li> <li> <code>HashedLinkedList<T></code>: Similar to the <code>LinkedList<T></code> above, but does not allow duplicates. As before, the time to find an item is reduced to O(1), but time to find its index remains O(n).</li> <li> <code>WrappedArray<T></code>: Fairly similar to the <code>ArrayList<T></code>, this acts as a wrapper around an array that implements <code>C5.IList<T></code>, but throws exceptions if an attempt is made to modify the collection (<code>IsFixedSize</code> is true and <code>Add</code>, <code>Remove</code>, <code>Insert</code> don't work; <code>Sort</code>, <code>Shuffle</code>, and <code>Reverse</code> do, however, as they are in-place operations).</li> </ul> Lists also provide a "View" functionality which represents a segment of the underlying list, allowing local operations to be performed. Using patterns offered in the C5 book, operations can be performed using views that are efficient on both array and linked lists. Any list operation can also be performed on a view, restricting their effect to a subset of the underlying list. Sorted Collections <ul> <li> <code>SortedArray<T></code>: Similar to an <code>ArrayList<T></code> except that it keeps its items sorted and does not allow duplicates. Note that random insertions and deletions on this collection are slow. This collection is best if the number of items is small or rarely modified but often accessed by item index or value.</li> <li> <code>TreeSet<T></code>: Uses a red-black tree structure to keep items sorted. As a set, it does not allow duplicates. Access by index or item value and insertion/deletion take O(log n).</li> <li> <code>TreeBag<T></code>: Uses a red-black tree, keeping items sorted. As a bag, it does allow duplicates, but does not store duplicates in the tree, rather keeping duplicates by counting.</li> </ul> Both <code>TreeSet<T></code> and <code>TreeBag<T></code> provide the ability to efficiently make "snapshots" or persistent copies of the tree in O(1), allowing iteration over the snapshot while modifying the underlying tree. Note that each snapshot on a tree adds a performance penalty to updates to the tree, but these effects go away when the snapshot is disposed. Hash Collections <ul> <li> <code>HashSet<T></code>: A collection using a simple hash table for storage. Access by item value takes O(1). As a set, it does not allow duplicates. Provides a function <code>BucketCostDistribution()</code> that can help tell you determine the efficiency of the items' hashcode function.</li> <li> <code>HashBag<T></code>: Similar to the <code>HashSet<T></code>, but has bag semantics, meaning that duplicates are allowed, but duplicates are only stored by counting.</li> </ul> Priority Queue <ul> <li> <code>IntervalHeap<T></code>: Provides a priority queue. Finding the maximum and minimum are O(1) operations, deleting the maximum, minimum, adding, and updating are O(log n) operations. Allows duplicates by storing them explicitly (not by counting).</li> </ul> Dictionaries <ul> <li> <code>HashDictionary<H,K></code>: Similar to the <code>SCG.Dictionary<H,K></code>, provides entry access, insertion, and deletion in O(1). Also provides a <code>BucketCostDistribution()</code> function as <code>HashSet<T></code> above. Does not guarantee any particular enumeration order.</li> <li> <code>TreeDictionary<H,K></code>: Similar to the <code>SCG.SortedDictionary<H,K></code>, provides a persistently sorted dictionary using a red-black tree. Entry access, insertion, and deletion take O(log n). Guarantees that enumeration of the dictionary follows the order specified by the key comparer.</li> </ul> Guarded Collections As well, C5 also offers "guarded" collections, which effectively acts as a read-only wrapper, preventing the collection from being modified. Items in the collection still may be modified, but items can't be added, deleted, or inserted into the collection. A long answer, but thorough on the C5 libraries various collections at your disposal. I have found the C5 library to be great and often use it in my own code, replacing the common C# header with: <pre class="prettyprint"><code>using C5; using SCG = System.Collections.Generic; </code></pre>

Where can I learn about the various types of .NET lists?

2 Answers

These aren't all lists, although they're all collections. Here's a quick summary.

Non-generic collections (API is in terms of object. Values types are boxed.

These are mostly in the System.Collections namespace:

ArrayList: A list of items, backed by an array. Fast random read/write access. Fast add to the tail end, if the buffer doesn't need resizing.
Hashtable: Map from key to value. Keys are unique, values don't have to be. Uses the GetHashCode method to achieve near O(1) read/write access (aside from nasty cases where all items have the same hash, or the backing store needs rebuilding). Iterating over the key/value pairs gives an unpredictable order. (Well, effectively unpredictable.)
SortedList: Like a Hashtable, but the entries are always returned in sorted-by-key order. Stored as a list of key/value pairs.
Stack: Last-in-first-out collection
Queue: First-in-first-out collection
Array: Fixed-size O(1) random-access; non-generic, but has strongly typed forms as well

Generic collections. (Strongly-typed API, will not box value types (assuming suitable T).

These are mostly in the System.Collections.Generic namespace:

List<T>: Like ArrayList
Dictionary<TKey, TValue>: like Hashtable
SortedList<TKey, TValue>: like SortedList
SortedDictionary<TKey, TValue>: like SortedList, but stored as a tree of key/value pairs which gives better performance in many situations. See docs for more detail.
LinkedList<T>: Doubly linked list (fast access to head and tail)
Stack<T>: Like Stack
Queue<T>: Like Queue
ReadOnlyCollection<T>: Like List<T> but giving a read-only view

Possibly the most important collection interface is IEnumerable (and IEnumerable<T>). This represents a sequence of items much like a Stream represents a sequence of bytes. There is no random access, just forward-reading. LINQ to Objects is based on this, and pretty much all collection types implement it.

124

answered Sep 16 '22 22:09

Jon Skeet

To expound on tobsen's earlier answer, the C5 Generic Collection Library has a large number of, well, collections. I'll describe some of them here:

Queue/Stack

CircularQueue<T>: This class provides strictly Queue and Stack functionality. As well, efficient O(1) access to any item in the Stack/Queue is available using the indexer: cq[0] (where 0 is the oldest item, next to be dequeued, last to be popped).

Lists

Note: ArrayList and LinkedList can also function as Queue/Stacks

ArrayList<T>: Similar to its counterpart in System.Collections.Generic (SCG), List<T>, this is backed by an array, guaranteeing O(1) indexing, but worst-case O(n) insertion.O(n) to find an item.
LinkedList<T>: Similar to its counterpart SCG.LinkedList<T>. Using a doubly-linked list, guarantees O(1) insertion, but worst-case O(n) indexing (in practice, is proportional to distance from either head or tail of the list). Also O(n) to find an item. Sorting uses a stable Merge Sort.
HashedArrayList<T>: Similar to the ArrayList<T> above, but does not allow duplicates. The benefit you get in return is that the time to find an item and its index is reduced to O(1).
HashedLinkedList<T>: Similar to the LinkedList<T> above, but does not allow duplicates. As before, the time to find an item is reduced to O(1), but time to find its index remains O(n).
WrappedArray<T>: Fairly similar to the ArrayList<T>, this acts as a wrapper around an array that implements C5.IList<T>, but throws exceptions if an attempt is made to modify the collection (IsFixedSize is true and Add, Remove, Insert don't work; Sort, Shuffle, and Reverse do, however, as they are in-place operations).

Lists also provide a "View" functionality which represents a segment of the underlying list, allowing local operations to be performed. Using patterns offered in the C5 book, operations can be performed using views that are efficient on both array and linked lists. Any list operation can also be performed on a view, restricting their effect to a subset of the underlying list.

Sorted Collections

SortedArray<T>: Similar to an ArrayList<T> except that it keeps its items sorted and does not allow duplicates. Note that random insertions and deletions on this collection are slow. This collection is best if the number of items is small or rarely modified but often accessed by item index or value.
TreeSet<T>: Uses a red-black tree structure to keep items sorted. As a set, it does not allow duplicates. Access by index or item value and insertion/deletion take O(log n).
TreeBag<T>: Uses a red-black tree, keeping items sorted. As a bag, it does allow duplicates, but does not store duplicates in the tree, rather keeping duplicates by counting.

Both TreeSet<T> and TreeBag<T> provide the ability to efficiently make "snapshots" or persistent copies of the tree in O(1), allowing iteration over the snapshot while modifying the underlying tree. Note that each snapshot on a tree adds a performance penalty to updates to the tree, but these effects go away when the snapshot is disposed.

Hash Collections

HashSet<T>: A collection using a simple hash table for storage. Access by item value takes O(1). As a set, it does not allow duplicates. Provides a function BucketCostDistribution() that can help tell you determine the efficiency of the items' hashcode function.
HashBag<T>: Similar to the HashSet<T>, but has bag semantics, meaning that duplicates are allowed, but duplicates are only stored by counting.

Priority Queue

IntervalHeap<T>: Provides a priority queue. Finding the maximum and minimum are O(1) operations, deleting the maximum, minimum, adding, and updating are O(log n) operations. Allows duplicates by storing them explicitly (not by counting).

Dictionaries

HashDictionary<H,K>: Similar to the SCG.Dictionary<H,K>, provides entry access, insertion, and deletion in O(1). Also provides a BucketCostDistribution() function as HashSet<T> above. Does not guarantee any particular enumeration order.
TreeDictionary<H,K>: Similar to the SCG.SortedDictionary<H,K>, provides a persistently sorted dictionary using a red-black tree. Entry access, insertion, and deletion take O(log n). Guarantees that enumeration of the dictionary follows the order specified by the key comparer.

Guarded Collections

As well, C5 also offers "guarded" collections, which effectively acts as a read-only wrapper, preventing the collection from being modified. Items in the collection still may be modified, but items can't be added, deleted, or inserted into the collection.

A long answer, but thorough on the C5 libraries various collections at your disposal. I have found the C5 library to be great and often use it in my own code, replacing the common C# header with:

using C5; using SCG = System.Collections.Generic;

answered Sep 17 '22 22:09

Marcus Griep

Related questions
                            
                                Where is the Stylecop configuration file?
                            
                                How to create .Net 5.0 Class Library project in Visual Studio 2019 16.8.1?
                            
                                Where is my System.Numerics namespace?
                            
                                Linq To Sql with PostgreSQL
                            
                                Get filtered items from a CollectionView
                            
                                ASP.NET MVC 4 jQuery Validation Script Bundle Not Working
                            
                                Static Method implementation in VB.NET
                            
                                Absolute URL from base + relative URL in C#
                            
                                Join and Include in Entity Framework
                            
                                ref and out in C++/CLI
                            
                                Best way to disable the column header sorting in DataGridView [duplicate]
                            
                                Global exception handler for windows services?
                            
                                Nested/Child TransactionScope Rollback
                            
                                Implement Lucene on Existing .NET / SQL Server stack with multiple webservers
                            
                                Why does "\n" give a new line on Windows?
                            
                                How to change Panel Border Color
                            
                                System.ComponentModel.Win32Exception: The operation completed successfully
                            
                                Event handlers not thread safe? [duplicate]
                            
                                Is there an API for Cruise Control .NET? [closed]
                            
                                SignalR 2.0.2 and Owin 2.0.0 dependency conflict

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Where can I learn about the various types of .NET lists?

Tags:

.net

collections

Simon Keep

People also ask

2 Answers

Jon Skeet

Marcus Griep

Recent Activity

Donate For Us