Fast ways to avoid duplicates in a List<> in C#

Tags:

My C# program generates random strings from a given pattern. These strings are stored in a list. As no duplicates are allowed I'm doing it like this:

Click to copy

List<string> myList = new List<string>(); for (int i = 0; i < total; i++) {   string random_string = GetRandomString(pattern);   if (!myList.Contains(random_string)) myList.Add(random_string); }

As you can imagine this works fine for several hundreds of entries. But I'm facing the situation to generate several million strings. And with each added string checking for duplicates gets slower and slower.

Are there any faster ways to avoid duplicates?

325

asked Jun 24 '13 14:06

Robert Strauch

1 Answers

Use a data structure that can much more efficiently determine if an item exists, namely a HashSet. It can determine if an item is in the set in constant time, regardless of the number of items in the set.

If you really need the items in a List instead, or you need the items in the resulting list to be in the order they were generated, then you can store the data in both a list and a hashset; adding the item to both collections if it doesn't currently exist in the HashSet.

answered Oct 14 '22 17:10

Servy

Related questions
                            
                                How to create a snk from pfx / cer?
                            
                                Subtracting two dates
                            
                                Similar to Pass in Python for C#
                            
                                Bind dictionary to repeater
                            
                                Group into a dictionary of elements
                            
                                c# working with Entity Framework in a multi threaded server
                            
                                Was C# compiler written in C++?
                            
                                What does the operator "<<" mean in C#?
                            
                                Could not determine a MetaTable
                            
                                How can I check if a Queue is empty?
                            
                                VS2013 Debugger + Entity Framework: "runtime has refused to evaluate the expression", crashes
                            
                                ExpectedException in nUnit gave me an error
                            
                                Could not load file or assembly System.Net.Http version 4.1.1.0
                            
                                How to determine if service has already been added to IServiceCollection
                            
                                How to force a SqlConnection to physically close, while using connection pooling?
                            
                                A Simple C# DLL - how do I call it from Excel, Access, VBA, VB6?
                            
                                The process cannot access the file because it is being used by another process
                            
                                View not updating after post
                            
                                Easiest way to have a program minimize itself to the system tray using .NET 4
                            
                                Create Json dynamically in c#

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fast ways to avoid duplicates in a List<> in C#

Tags:

c#

list

duplicates

Robert Strauch

People also ask

1 Answers

Servy

Recent Activity

Donate For Us