I have a vector containing few non-adjacent duplicates. As a simple example, consider: <pre class="prettyprint"><code>2 1 6 1 4 6 2 1 1 </code></pre> I am trying to make this <code>vector</code> unique by removing the non-adjacent duplicates and maintaining the order of elements. Result would be: <pre class="prettyprint"><code>2 1 6 4 </code></pre> The solutions I tried are: <ol> <li>Inserting into a std::set but the problem with this approach is that it will disturb the order of elements.</li> <li>Use the combination of std::sort and std::unique. But again same order problem.</li> <li> Manual duplicate elimination: <pre class="prettyprint"><code> Define a temporary vector TempVector. for (each element in a vector) { if (the element does not exists in TempVector) { add to TempVector; } } swap orginial vector with TempVector. </code></pre> </li> </ol> My question is: Is there any STL algorithm which can remove the non-adjacent duplicates from the vector ? what is its complexity?

I think you would do it like this: I would use two iterators on the vector : The first of one reads the data and inserts it a temporary set. When the read data was not in the set you copy it from the first iterator to the second and increment it. At the end you keep only the data up to the second iterator. The complexity is O( n .log( n ) ) as the lookup for duplicated elements uses the set, not the vector. <pre class="prettyprint"><code>#include <vector> #include <set> #include <iostream> int main(int argc, char* argv[]) { std::vector< int > k ; k.push_back( 2 ); k.push_back( 1 ); k.push_back( 6 ); k.push_back( 1 ); k.push_back( 4 ); k.push_back( 6 ); k.push_back( 2 ); k.push_back( 1 ); k.push_back( 1 ); { std::vector< int >::iterator r , w ; std::set< int > tmpset ; for( r = k.begin() , w = k.begin() ; r != k.end() ; ++r ) { if( tmpset.insert( *r ).second ) { *w++ = *r ; } } k.erase( w , k.end() ); } { std::vector< int >::iterator r ; for( r = k.begin() ; r != k.end() ; ++r ) { std::cout << *r << std::endl ; } } } </code></pre>

Without using a temporary <code>set</code> it's possible to do this with (possibly) some loss of performance: <pre class="prettyprint"><code>template<class Iterator> Iterator Unique(Iterator first, Iterator last) { while (first != last) { Iterator next(first); last = std::remove(++next, last, *first); first = next; } return last; } </code></pre> used as in: <pre class="prettyprint"><code>vec.erase( Unique( vec.begin(), vec.end() ), vec.end() ); </code></pre> For smaller data sets, the implementation simplicity and lack of extra allocation required may offset the theoretical higher complexity of using an additional <code>set</code>. Measurement with a representative input is the only way to be sure, though.

How to make elements of vector unique? (remove non adjacent duplicates)

Tags:

c++

unique

stl

vector

I have a vector containing few non-adjacent duplicates.

As a simple example, consider:

2 1 6 1 4 6 2 1 1

I am trying to make this vector unique by removing the non-adjacent duplicates and maintaining the order of elements.

Result would be:

2 1 6 4

The solutions I tried are:

Inserting into a std::set but the problem with this approach is that it will disturb the order of elements.
Use the combination of std::sort and std::unique. But again same order problem.

Manual duplicate elimination:

    Define a temporary vector TempVector.     for (each element in a vector)     {         if (the element does not exists in TempVector)         {             add to TempVector;         }     }     swap orginial vector with TempVector.

My question is:

Is there any STL algorithm which can remove the non-adjacent duplicates from the vector ? what is its complexity?

253

asked Sep 21 '09 07:09

aJ.

2 Answers

I think you would do it like this:

I would use two iterators on the vector :

The first of one reads the data and inserts it a temporary set.

When the read data was not in the set you copy it from the first iterator to the second and increment it.

At the end you keep only the data up to the second iterator.

The complexity is O( n .log( n ) ) as the lookup for duplicated elements uses the set, not the vector.

#include <vector> #include <set> #include <iostream>  int main(int argc, char* argv[]) {     std::vector< int > k ;      k.push_back( 2 );     k.push_back( 1 );     k.push_back( 6 );     k.push_back( 1 );     k.push_back( 4 );     k.push_back( 6 );     k.push_back( 2 );     k.push_back( 1 );     k.push_back( 1 );  {     std::vector< int >::iterator r , w ;      std::set< int > tmpset ;      for( r = k.begin() , w = k.begin() ; r != k.end() ; ++r )     {         if( tmpset.insert( *r ).second )         {             *w++ = *r ;         }     }      k.erase( w , k.end() ); }       {         std::vector< int >::iterator r ;          for( r = k.begin() ; r != k.end() ; ++r )         {             std::cout << *r << std::endl ;         }     } }

105

answered Sep 20 '22 22:09

fa.

Without using a temporary set it's possible to do this with (possibly) some loss of performance:

template<class Iterator> Iterator Unique(Iterator first, Iterator last) {     while (first != last)     {         Iterator next(first);         last = std::remove(++next, last, *first);         first = next;     }      return last; }

used as in:

vec.erase( Unique( vec.begin(), vec.end() ), vec.end() );

For smaller data sets, the implementation simplicity and lack of extra allocation required may offset the theoretical higher complexity of using an additional set. Measurement with a representative input is the only way to be sure, though.

answered Sep 20 '22 22:09

CB Bailey

Related questions
                            
                                Is the typedef-name optional in a typedef declaration?
                            
                                Why is std::atomic<bool> much slower than volatile bool?
                            
                                Why is a constructor necessary in a const member struct?
                            
                                Class template argument deduction not working with alias template
                            
                                Difference between sizeof(empty struct) and sizeof(struct with empty array)?
                            
                                Lua vs Embedded Lisp and potential other candidates. for set based data processing
                            
                                Does using std::array<T, N> lead to code bloat? [duplicate]
                            
                                How does the computer calculate Square roots? [closed]
                            
                                How to get protobuf enum as string?
                            
                                Why does this function call behave sensibly after calling it through a typecasted function pointer?
                            
                                C++ "Named Parameter Idiom" vs. Boost::Parameter library
                            
                                How to handle or avoid a stack overflow in C++
                            
                                C++ GDB Python Pretty Printing Tutorial?
                            
                                What is the meaning of empty "<>" in template usage?
                            
                                What is the order of destruction of function arguments?
                            
                                Why is name mangling not standardized
                            
                                Which <type_traits> cannot be implemented without compiler hooks?
                            
                                Why base class destructor (virtual) is called when a derived class object is deleted?
                            
                                Returning a c++ std::vector without a copy?
                            
                                What is assignment via curly braces called? and can it be controlled?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With