Given two sorted arrays: <code>A</code> and <code>B</code>. The size of array <code>A</code> is <code>La</code> and the size of array <code>B</code> is <code>Lb</code>. How to find the intersection of <code>A</code> and <code>B</code>? If <code>La</code> is much bigger than <code>Lb</code>, then will there be any difference for the intersection finding algorithm?

Since this looks like a HW...I'll give you the algorithm: <pre class="prettyprint"><code>Let arr1,arr2 be the two sorted arrays of length La and Lb. Let i be index into the array arr1. Let j be index into the array arr2. Initialize i and j to 0. while(i < La and j < Lb) do if(arr1[i] == arr2[j]) { // found a common element. print arr[i] // print it. increment i // move on. increment j } else if(arr1[i] > arr2[j]) increment j // don't change i, move j. else increment i // don't change j, move i. end while </code></pre>

I've been struggling with same problem for a while now, so far I came with: <ol> <li>Linear matching which will yield O(m+n) in worst case. You basically keep two pointers (A and B) each pointing to beginning of each array. Then advance pointer which points to smaller value, until you reach end of one of arrays, that would indicate no intersection. If at any point you have *A == *B - here comes your intersection.</li> <li>Binary matching. Which yields ~ O(n*log(m)) in worst case. You basically pick smaller array and perform binary search in bigger array of all elements of the smaller array. If you want to be more fancy, you can even use last position where binary search failed and use it as starting point for next binary search. This way you marginally improve worst case but for some sets it might perform miracles :)</li> <li>Double binary matching. It's a variation of regular binary matching. Basically you get element from the middle of smaller array and do binary search in bigger array. If you find nothing then you cut smaller array in half (and yes you can toss element you already used) and cut bigger array in half (use binary search failure point). And then repeat for each pair. Results are better than O(n*log(m)) but I am too lazy to calculate what they are.</li> </ol> Those are two most basic ones. Both have merits. Linear is a bit easier to implement. Binary one is arguably faster (although there are plenty of cases when linear matching will outperform binary). If anyone knows anything better than that I would love to hear it. Matching arrays is what I do these days. P.S. don't quote me on terms "linear matching" and "binary matching" as I made them up myself and there are probably fancy name for it already.

The intersection of two sorted arrays

2 Answers

Since this looks like a HW...I'll give you the algorithm:

Let arr1,arr2 be the two sorted arrays of length La and Lb.
Let i be index into the array arr1.
Let j be index into the array arr2.
Initialize i and j to 0.

while(i < La and j < Lb) do

    if(arr1[i] == arr2[j]) { // found a common element.
        print arr[i] // print it.
        increment i // move on.
        increment j
    }
    else if(arr1[i] > arr2[j])
        increment j // don't change i, move j.
    else
        increment i // don't change j, move i.
end while

116

answered Oct 12 '22 00:10

codaddict

I've been struggling with same problem for a while now, so far I came with:

Linear matching which will yield O(m+n) in worst case. You basically keep two pointers (A and B) each pointing to beginning of each array. Then advance pointer which points to smaller value, until you reach end of one of arrays, that would indicate no intersection. If at any point you have *A == *B - here comes your intersection.
Binary matching. Which yields ~ O(n*log(m)) in worst case. You basically pick smaller array and perform binary search in bigger array of all elements of the smaller array. If you want to be more fancy, you can even use last position where binary search failed and use it as starting point for next binary search. This way you marginally improve worst case but for some sets it might perform miracles :)
Double binary matching. It's a variation of regular binary matching. Basically you get element from the middle of smaller array and do binary search in bigger array. If you find nothing then you cut smaller array in half (and yes you can toss element you already used) and cut bigger array in half (use binary search failure point). And then repeat for each pair. Results are better than O(n*log(m)) but I am too lazy to calculate what they are.

Those are two most basic ones. Both have merits. Linear is a bit easier to implement. Binary one is arguably faster (although there are plenty of cases when linear matching will outperform binary).

If anyone knows anything better than that I would love to hear it. Matching arrays is what I do these days.

P.S. don't quote me on terms "linear matching" and "binary matching" as I made them up myself and there are probably fancy name for it already.

answered Oct 12 '22 00:10

Nazar

Related questions
                            
                                "Multiple definition of" C++ compiler error
                            
                                Are memory leaks "undefined behavior" class problem in C++?
                            
                                Multiple conditions in switch case?
                            
                                C/C++: Pointer Arithmetic
                            
                                What's the 'long' data type used for?
                            
                                Linking errors when compiling code with OpenCV Libraries
                            
                                What is the difference between these (bCondition == NULL) and (NULL==bCondition)?
                            
                                What is the difference between const virtual and virtual const?
                            
                                C++ Builder or Visual Studio for native C++ development?
                            
                                Why or why not should I use 'UL' to specify unsigned long?
                            
                                why can't we declare object of a class inside the same class?
                            
                                boost shared_from_this<>()
                            
                                How to generate a LONG guid?
                            
                                Does GCC support long long int?
                            
                                The Benefits of Using Function Pointers
                            
                                Where should I go after learning C++? [closed]
                            
                                C++ Qt Multiple Definitions
                            
                                Will multi threading provide any performance boost?
                            
                                Adding empty element to declared container without declaring type of element
                            
                                Print binary tree in a pretty way using c++

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

The intersection of two sorted arrays

Tags:

c++

arrays

algorithm

sorting

user288609

People also ask

2 Answers

codaddict

Nazar

Recent Activity

Donate For Us