Efficient algorithm to check duplicate rows in a matrix

Tags:

Given a matrix M of integers. Check if two rows are identical in the matrix. Give an optimum approach.

Example:
[{1, 2, 3},
 {3, 4, 5},
 {1, 2, 3}]

In the above matrix, rows 1 and 3 are identical.

Possible Solution:

Given a matrix, we can convert each row in a string (example using to_string()
method of C++ and concatenating each element in a row to a string). We do this
for every row of the matrix, and insert it in a table that is something like
(map<string, int> in C++). And hence, duplicate row can be checked in O(mn) time
for an mxn matrix.

Can I do better than this ? Or, above method has any flaw ?

953

asked Oct 17 '13 23:10

Rahul Sharma

1 Answers

Your method works but you are wrong with the complexity of it.

Firstly, testing if an element is in a std::map has complexity O(log(n) * f), where n is the number of elements in the map and f is an upper bound for the time required to comparing any two elements inserted/searched in the map.

In your case, every string has a length m, so comparing any two elements in the map costs O(m).

So the total complexity of your method is:

O(n * log(n) * m) for inserting n strings in the map.

However, you can speed it up to O(n * m) in expectation, which is asymptotically optimal (because you have to read all the data), using a hash table rather than a map. The reason for this is that a hash table has O(1) average complexity for an insert operation and the hash function for every input string is computed only once.

In C++ you can use the unordered_set for that.

190

answered Nov 11 '22 22:11

pkacprzak

Related questions
                            
                                Inference engines vs Decision trees [closed]
                            
                                Finding an Insertion in a String
                            
                                Getting n smallest numbers in a sequence
                            
                                Is there a name for this sampling algorithm used in Minicraft?
                            
                                Adding Accents to Speech Generation
                            
                                Finding the Nth Twin Prime
                            
                                How to determine which aspect ratios are closest
                            
                                Given a RNG algorithm and a series of numbers is it possible to determine what seed would produce the series?
                            
                                What's the best way to merge a set of rectangles in an image?
                            
                                Select distinct groups of rows according to average
                            
                                Using Strongly Connected Component Algo to check if a vertex is reachable
                            
                                Complexity of the QuickHull Algorithm?
                            
                                Divide up a rectangle based on pairs of points
                            
                                Is there an efficient way to count the number of intersections among a given set of line segments?
                            
                                Merkle Tree Data Synchronization False Positives
                            
                                Why is KNN much faster than decision tree?
                            
                                Given many horizontal and vertical lines, how to find all the rectangles that do have any sub-rectangle inside them?
                            
                                Detect when a graph has been broken into two or more connected components
                            
                                Improve the solution to monkey grid puzzle
                            
                                Fast functional merge sort

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Efficient algorithm to check duplicate rows in a matrix

Tags:

algorithm

time-complexity

matrix

Rahul Sharma

People also ask

1 Answers

pkacprzak

Recent Activity

Donate For Us