I have the following code which break up vectorOfInterest into smaller blocks to send out. this code is working. However, I do a copy when I split the vectorOfInterest into smaller chunks (in the constructor of subList and remainder). is there a better to use move instead of duplicating the data again for better performance? note that i cannot change the argument of OTHERCLASS::doSend() Edit: i am using C++98 <pre class="prettyprint"><code>int blockSize = 50; vector <CLASS_T> vectorOfInterest; // ...<populates vectorOfInterest> do { if(vectorOfInterest.size()> blockSize) vector<CLASS_T>iterator from = vectorOfInterest.begin(); vector<CLASS_T>iterator to = from + blockSize; //elements are copied again in subList and remainder //I like to move the elements from vectorOfInterest instead. vector<CLASS_T> subList (from, to); vector<CLASS_T> remainder (to, vectorOfInterest.end()); vectorOfInterest.swap(remainder); OTHERCLASS::doSend (subList); // method which sends sublists in blocks of exactly 50 to external library }else { //pad to exactly size 50 vectorOfInterest.resize(blockSize); OTHERCLASS::dosend (vectorOfInterest); // method which sends sublists in blocks of exactly 50 to external library vectorOfInterest.clear(); } while ( !vectorOfInterest.empty()); </code></pre>

You shouldn't be erasing elements from <code>vectorOfInterest</code> every iteration. That involves a lot of unnecessary copying. Instead, keep a persistent iterator. You can also avoid doing an allocation of the sublist every iteration. <pre class="prettyprint"><code>vector<CLASS_T>::iterator from = vectorOfInterest.begin(); vector<CLASS_T> subList; do { if(vectorOfInterest.end() - from > blockSize) { subList.assign(from, from + blockSize); from += blockSize; OTHERCLASS::doSend(subList); }else { subList.assign(from, vectorOfInterest.end()); subList.resize(blockSize); OTHERCLASS::dosend (subList); vectorOfInterest.clear(); subList.clear(); } } while ( !vectorOfInterest.empty()); </code></pre>

If you do not need a real copy and your original vector remains unchanged and accessible, you could send appropriate iterator ranges <code>boost::iterator_range<std::vector<CLASS_T>::iterator></code> to the library. For this approach, I assume the other class to take a template argument that behaves like a container.

Most efficient way to split a vector into several

Tags:

c++

vector

c++98

I have the following code which break up vectorOfInterest into smaller blocks to send out. this code is working.

However, I do a copy when I split the vectorOfInterest into smaller chunks (in the constructor of subList and remainder). is there a better to use move instead of duplicating the data again for better performance?

note that i cannot change the argument of OTHERCLASS::doSend()

Edit: i am using C++98

int blockSize = 50;
vector <CLASS_T> vectorOfInterest; 

// ...<populates vectorOfInterest>
do {
    if(vectorOfInterest.size()> blockSize)
        vector<CLASS_T>iterator from = vectorOfInterest.begin();
        vector<CLASS_T>iterator to = from + blockSize;

        //elements are copied again in subList and remainder
        //I like to move the elements from vectorOfInterest instead.
        vector<CLASS_T> subList (from, to);  
        vector<CLASS_T> remainder (to, vectorOfInterest.end());
        vectorOfInterest.swap(remainder);

        OTHERCLASS::doSend (subList); // method which sends sublists in blocks of exactly 50 to external library
    }else {
        //pad to exactly size 50 
        vectorOfInterest.resize(blockSize);

         OTHERCLASS::dosend (vectorOfInterest); // method which sends sublists in blocks of exactly 50 to external library

        vectorOfInterest.clear();
    }

while ( !vectorOfInterest.empty());

710

asked Aug 27 '14 06:08

Angel Koh

3 Answers

You shouldn't be erasing elements from vectorOfInterest every iteration. That involves a lot of unnecessary copying. Instead, keep a persistent iterator. You can also avoid doing an allocation of the sublist every iteration.

vector<CLASS_T>::iterator from = vectorOfInterest.begin();
vector<CLASS_T> subList;

do {
    if(vectorOfInterest.end() - from > blockSize) {    
        subList.assign(from, from + blockSize);
        from += blockSize;    
        OTHERCLASS::doSend(subList);
    }else {            
        subList.assign(from, vectorOfInterest.end());
        subList.resize(blockSize);    
        OTHERCLASS::dosend (subList);    
        vectorOfInterest.clear();
        subList.clear();
    }    
} while ( !vectorOfInterest.empty());

189

answered Oct 12 '22 22:10

Benjamin Lindley

If you do not need a real copy and your original vector remains unchanged and accessible, you could send appropriate iterator ranges boost::iterator_range<std::vector<CLASS_T>::iterator> to the library. For this approach, I assume the other class to take a template argument that behaves like a container.

answered Oct 13 '22 00:10

SebastianK

A more important optimisation might be the excessive copying done when you create remainder.

try this:

int blockSize = 50;
vector <CLASS_T> vectorOfInterest; 
vector <CLASS_T> subList(blockSize);

size_t vectorSize = vectorOfInterest.size();

for(int i = 0; i < vectorSize(); i += blockSize)
{
    vector<CLASS_T>iterator from = vectorOfInterest.begin() + i;
    size_t thisBlockSize = min(i+blockSize, vectorSize);
    vector<CLASS_T>iterator to = vectorOfInterest.begin() + thisBlockSize;

    //replace this with an elementwise swap if you can
    std::copy(from, to, subList.begin());
    std::fill(to, vectorOfInterest.end(), /*what ever you want*/);

    OTHERCLASS::doSend (subList);
}

Edit: I just saw the C++98 part. Moving is out of question. In otherwords, you are allocating a new vector twice in your loop, and copying each element twice. By prealloating a vector above the loop you are only allocating one vector (the one that doSend recieves). To avoid copying elements twice is harder. Since you can invalidate the vector's elements, swapping and then copying is possible.

answered Oct 13 '22 00:10

user3125280

Related questions
                            
                                Get index of the matching item from vector c++
                            
                                "Launch failed. Binary not found." error on CDT Kepler Eclipse
                            
                                Using std::extent on std::array
                            
                                Can we implement a max or min macro, which can take variable arguments (more than two parameters )
                            
                                Can printf result in undefined behavior? [duplicate]
                            
                                Put an `unsigned char` into a `char`
                            
                                Declare a bit in C++
                            
                                Strange exception throw - assign: Operation not permitted
                            
                                LibCurl CURLOPT_URL not accepting string? C++
                            
                                How to convert from utf-16 to utf-32 on Linux with std library?
                            
                                Using setEnabled() or setDisabled()
                            
                                zlib in Qt - QtZlib not present
                            
                                QtCreator returns error "Cannot change to working directory"
                            
                                Boolean value returned on a function call to null object
                            
                                How to neatly initialize struct tm from ctime
                            
                                C++11 future.wait_for() always returns future_status::timeout
                            
                                Missing Dll in dependency walker
                            
                                condition_variable throwing system_error with Xcode - fine with VStudio
                            
                                Fastest way to copy one vector into another conditionally
                            
                                OpenCV's projectPoints function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With