Find Duplicates in an array in O(N) time

Tags:

algorithm

Is there a way to find all the duplicate elements in an array of N elements in O(N) time?

Example:

Input: 11, 29, 81, 14, 43, 43, 81, 29

Output: 29, 81, 43

Sorting the input and doing a linear scan to detect duplicates destroys the order and gives the output: 29,43,81.

Sorting-by-key another array of indices {0,1,...N-1} according to the given array to get {1,4,2} and then sorting the resultant set of indices to get {1,2,4} will give us {29,81,43}, but this takes O(N logN) time.

Is there an O(N) algorithm to solve this problem?

P.S. I forgot to add: I dont want to use hash tables. I am looking for a non-hash solution.

928

asked Oct 01 '11 06:10

alpha_cod

1 Answers

I believe a good solution (decent memory usage, can be used to immediately determine if an entry has already been seen thus preserving order, and with a linear complexity) is a trie.

If you insert the elements into the trie as if they were a string with each digit (starting from the MSD) in each node, you can pull this off with a complexity of O(m N) where m is the average length of numbers in base-10 digits.

You'd just loop over all your entries and insert them into the trie. Each time an element already exists, you skip it and move on to the next. Duplicates in this (unlike in my previous answer of a Radix Sort) will be found immediately instead of in the last iteration or what not.

I'm not sure if you would benefit from using a suffix tree here, as the "base" of the characters being entered into the trie is only 10 (compared to the base-128 for ANSI strings), but it's possible.

175

answered Oct 16 '22 12:10

Mahmoud Al-Qudsi

Related questions
                            
                                C++: auto_ptr + forward declaration?
                            
                                Any reason to use SecureZeroMemory() instead of memset() or ZeroMemory() when security is not an issue?
                            
                                boost::trim each string in std::vector<std::string>
                            
                                Is it possible to find out if a VNC connection is active
                            
                                typecasting to unsigned in C
                            
                                Create a big array in C++ [duplicate]
                            
                                Is boost::interprocess ready for prime time? [closed]
                            
                                Why does my C++ subclass need an explicit constructor?
                            
                                Complex initialization of const fields
                            
                                What does the warning "alignment of a member was sensitive to packing" mean in C++
                            
                                Why floating point value such as 3.14 are considered as double by default in MSVC?
                            
                                Defining iterator of my own container
                            
                                How to make Valgrind debugger step through a program
                            
                                Qt - custom decimal point and thousand separator
                            
                                C++ type conversion FAQ
                            
                                How to rename a resource in Visual Studio?
                            
                                To invoke a variadic function with unamed arguments of another variadic function
                            
                                When a `key/value` is inserted into a `std::map`, does it make its own copy of the objects?
                            
                                Disable warning in MSVC++2010
                            
                                Use one socket in two threads [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With