efficient way to search for string in list of string?

Tags:

I have a list of strings and need to find which strings match a given input value. what is the most efficient way (memory vs execution speed) for me to store this list of strings and be able to search through it? The start-up and loading of the list of strings isnt important, but the response time for searching is.

should i be using a List or HashSet or just a basic string[] or something else?

387

asked Dec 28 '11 15:12

MakkyNZ

3 Answers

It depends very much on the nature of the strings and the size of the collection. Depending on characteristics of the collection, and the expected search strings, there are ways to organize things very cleverly so that searching is very fast. You haven't given us that information.

But here's what I'd do. I'd set a reasonable performance requirement. Then I'd try a n-gram index (why? because you said in a comment you need to account for partial matches; a HashSet<string> won't help you here) and I'd profile reasonable inputs that I expect against this solution and see if it meets my performance requirements or not. If it does, I'd accept the solution and move on. If it doesn't, I'd think very carefully about whether or not my performance requirements are reasonable. If they are, I'd start thinking about whether or not there is something special about my inputs and collection that might enable me to use some more clever solutions.

125

answered Nov 11 '22 01:11

jason

It seems the best way is to build a suffix tree of your input in O(input_len) time then do queries of your patterns in O(pattern_length) time. So if your text is really big compared to your patterns, this will work well.

See Ukkonen's algorithm for building a suffix tree.

If you want inexact matching...see the work of Gonzalo Navarro.

answered Nov 11 '22 02:11

Cris Stringfellow

Use a Dictionary<string>() or an HashSet<string> is probably good for you.

Look here for Dictionary
and here for HashSet

answered Nov 11 '22 03:11

Felice Pollano

Related questions
                            
                                Changing A label text without PostBack (using Update Panels)
                            
                                Generic type from string value
                            
                                LINQ WHERE statement/ignore conditions
                            
                                Can resharper jump to the file that contains the unit tests?
                            
                                Cosine Similarity Code (non-term vectors)
                            
                                Visual Studio Extension for Code Alignment [closed]
                            
                                Trying to learn about the new async features in c#
                            
                                How to subclass UIApplication in Monotouch?
                            
                                number of elements in Tuple<...>
                            
                                Recommended behaviour of GetEnumerator() when implementing IEnumerable<T> and IEnumerator<T>
                            
                                How to add multiple lines of EventData to an EventLog in Windows?
                            
                                What is the equivalent way to set post parameters in .net?
                            
                                Store multi-type OrderBy expression as a property
                            
                                .NET C# - MigraDoc - How to change document charset?
                            
                                C# Open DBF file
                            
                                How to check if a selected row in a datagridview is empty(has no item) C#
                            
                                Not applying the CSS while generating PDF using iTextsharp.dll
                            
                                Looking for a REST with JSON client library [closed]
                            
                                Create firewall rule to open port per application programmatically in c#
                            
                                how to use Invoke method in a file of extensions/methods?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

efficient way to search for string in list of string?

Tags:

memory-management

c#

.net