Getting distinct and ordered members from a list of strings - linq or hashset for unique which one is faster / better suited

Tags:

I have a big list of strings (about 5k-20k entries) that I need to order and also to remove duplicates from.

I've done this in 2 ways now, once with a hashset and once solely with linq. Tests with that number of entries did not show a big difference but I'm wondering what way and thus what method would be better suited.

For the ways (myList is of the datatype List):

Linq: I'm using 1 linq statement to order the list and get the distinct values from it.

myList = myList.OrderBy(q => q).Distinct().ToList();

Hashset: I'm using hashset to remove all duplicates and then I'm ordering the list

myList = new HashSet<String>(myList).ToList<String>();
myList = myList.OrderBy(q => q).ToList();

Like I said tests I made were about the same time consumption for both methods but I'm still wondering if one method is better than the other and if so why (the code is for a high performance part and I need to get every millisecond I can out of it).

772

asked Aug 21 '14 08:08

Thomas

1 Answers

If you're really concerned about every nanosecond, then

myList = myList.Distinct().OrderBy(q => q).ToList();

might be slightly faster than:

myList = myList.OrderBy(q => q).Distinct().ToList();

if there are a large number of duplicates.

The LINQ method is more readable and will have similar performance to explicitly creating a HashSet<T> as others have said. In fact it may be slightly faster if the original List is already sorted, since the LINQ method will preserve the initial order before sorting, while explicitly creating a HashSet<T> will enumerate in an undefined order.

answered Oct 13 '22 01:10

Joe

Related questions
                            
                                Get only result of update query
                            
                                The type initializer for 'java.lang.System' threw an exception. Inner Exception: Unable to load DLL 'vjsnativ':
                            
                                Always Running Threads on Windows Service
                            
                                C# Microsoft Access Parameterized Queries not doing its job
                            
                                A comment may not be placed within the bracketed statement
                            
                                Bind to a Dependency Property that is in parent's DataContext
                            
                                Adding Claims in the MVC 5 app with Owin and windows authentication
                            
                                Entity Framework and Thread safety of ObjectContext
                            
                                ReactiveUI vs. ICollectionView
                            
                                Why do I need to implement IComparable<T> to compare two values in generic method?
                            
                                Trouble implementing GetDeviceUniqueID on Windows Mobile 6
                            
                                c# GridViewColumns missing assembly reference
                            
                                Drag and Drop custom controls between cells in a grid in WPF
                            
                                Type.BaseType in Portable Class Library
                            
                                C# Conditional equivalent of !DEBUG [duplicate]
                            
                                IIS hosted WCF with SSL security -"The HTTP request was forbidden with client authentication scheme 'Anonymous'" error
                            
                                Deleting project/pages/usercontrols from memory
                            
                                How can Json.NET perform dependency injection during deserialization?
                            
                                Guarantee code execution even on process kill
                            
                                How to create a minimal dummy X509Certificate2?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Getting distinct and ordered members from a list of strings - linq or hashset for unique which one is faster / better suited

Tags:

c#

optimization

linq

hashset

Thomas

People also ask

1 Answers

Joe

Recent Activity

Donate For Us