While looking over some code I found loopy and algorithmically slow implementation of std::set_difference : <pre class="prettyprint"><code> for(int i = 0; i < a.size(); i++) { iter = std::find(b.begin(),b.end(),a[i]); if(iter != b.end()) { b.erase(iter); } } </code></pre> It can be easily replaced with sort(vectors are not sorted) + set_difference, but that requires allocation of new memory(see my recent Q Can output of set difference be stored in first input? why it cant be done "inplace"). So my solution would be something like: <pre class="prettyprint"><code>sort(a.begin(), a.end()); for(size_t i = 0; i < b.size(); i++) { if (binary_search(a.begin(), a.end(), b[i])) { swap(b[i], b[b.size()-1]); //remove current element by swapping with last b.pop_back(); // and removing new last by shrinking } } </code></pre> can it be done more elegantly? elegant is subjective so within scope of this Q is defined as clearer code(ideally something from STL algorithms but I think it cant be done) but with no memory allocation and no increase in alg complexity.

Sort <code>b</code> so you can binary search it in order to reduce time complexity. Then use the erase-remove idiom in order to throw away all elements from <code>a</code> that are contained in <code>b</code>: <pre class="prettyprint"><code>sort( begin(b), end(b) ); a.erase( remove_if( begin(a),end(a), [&](auto x){return binary_search(begin(b),end(b),x);}), end(a) ); </code></pre> Of course, you can still sacrifice time complexity for simplicity and reduce your code by removing the <code>sort()</code> and replacing <code>binary_search()</code> by <code>find()</code>: <pre class="prettyprint"><code>a.erase( remove_if( begin(a),end(a), [&](auto x){return find(begin(b),end(b),x)!=end(b);}), end(a) ); </code></pre> This is a matter of taste. In both cases you don't need heap allocations. By the way, I'm using lambda auto parameters which are C++14. Some compilers already implement that feature such as clang. If you don't have such a compiler, but only C++11 then replace <code>auto</code> by the element type of the container. By the way, this code does not mention any types! You can write a template function so it works for all kind of types. The first variant requires random access iteration of <code>b</code> while the second piece of code does not require that.

elegant way to remove all elements of a vector that are contained in another vector?

Tags:

c++

stl

While looking over some code I found loopy and algorithmically slow implementation of std::set_difference :

 for(int i = 0; i < a.size(); i++)
 {
  iter = std::find(b.begin(),b.end(),a[i]);
  if(iter != b.end())
  {
     b.erase(iter);
  }
 }

It can be easily replaced with sort(vectors are not sorted) + set_difference, but that requires allocation of new memory(see my recent Q Can output of set difference be stored in first input? why it cant be done "inplace").
So my solution would be something like:

sort(a.begin(), a.end());
for(size_t i = 0; i < b.size(); i++)
{
 if (binary_search(a.begin(), a.end(), b[i]))
 {
     swap(b[i], b[b.size()-1]); //remove current element by swapping with last
     b.pop_back();     // and removing new last by shrinking
 }
}

can it be done more elegantly?
elegant is subjective so within scope of this Q is defined as clearer code(ideally something from STL algorithms but I think it cant be done) but with no memory allocation and no increase in alg complexity.

607

asked Jan 17 '14 20:01

NoSenseEtAl

2 Answers

Sort b so you can binary search it in order to reduce time complexity. Then use the erase-remove idiom in order to throw away all elements from a that are contained in b:

sort( begin(b), end(b) );
a.erase( remove_if( begin(a),end(a),
    [&](auto x){return binary_search(begin(b),end(b),x);}), end(a) );

Of course, you can still sacrifice time complexity for simplicity and reduce your code by removing the sort() and replacing binary_search() by find():

a.erase( remove_if( begin(a),end(a),
    [&](auto x){return find(begin(b),end(b),x)!=end(b);}), end(a) );

This is a matter of taste. In both cases you don't need heap allocations. By the way, I'm using lambda auto parameters which are C++14. Some compilers already implement that feature such as clang. If you don't have such a compiler, but only C++11 then replace auto by the element type of the container.

By the way, this code does not mention any types! You can write a template function so it works for all kind of types. The first variant requires random access iteration of b while the second piece of code does not require that.

103

answered Sep 20 '22 18:09

Ralph Tandetzky

This one does it in O(N+M), assuming both arrays are sorted.

  auto ib = std::begin(two);
  auto iter = std::remove_if (
       std::begin(one), std::end(one),
       [&ib](int x) -> bool {
                       while  (ib != std::end(two) && *ib < x) ++ib;
                       return (ib != std::end(two) && *ib == x);
                     });

answered Sep 20 '22 18:09

n. 1.8e9-where's-my-share m.

Related questions
                            
                                Is there a way to check if a variable is a whole number? C++
                            
                                ofstream exception handling
                            
                                Endianness swap without ntohs
                            
                                Why does C++ not let baseclasses implement a derived class' inherited interface?
                            
                                ArgMin for vector<double> in C++?
                            
                                Relying on ADL for std::begin() and std::end()?
                            
                                Why no timeout support in std::lock?
                            
                                How to handle list in R to Rcpp
                            
                                how to get matching key using the value in a map C++
                            
                                C++ returning temporary objects confusion
                            
                                Error "xxxx"does not name a type
                            
                                C++11 std::thread::detach and access to shared data
                            
                                Performance of std::function compared to raw function pointer and void* this?
                            
                                Simplest way to initialize multiple related const properties in a constructor?
                            
                                QObject::connect: Cannot queue arguments of type 'int&'
                            
                                OpenCV: Calculate angle between camera and pixel
                            
                                Remove blinking underscore on console / cmd prompt
                            
                                Check array position for null/empty
                            
                                Getting opencv error in c++
                            
                                How to read a file in multiple chunks until EOF (C++)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With