How to fill memory fast with a `int32_t` value?

Tags:

Is there a function (SSEx intrinsics is OK) which will fill the memory with a specified int32_t value? For instance, when this value is equal to 0xAABBCC00 the result memory should look like:

AABBCC00AABBCC00AABBCC00AABBCC00AABBCC00
AABBCC00AABBCC00AABBCC00AABBCC00AABBCC00
AABBCC00AABBCC00AABBCC00AABBCC00AABBCC00
AABBCC00AABBCC00AABBCC00AABBCC00AABBCC00
...

I could use std::fill or simple for-loop, but it is not fast enough.

Resizing of a vector performed only once in the beginning of program, this is not an issue. The bottleneck is filling the memory.

Simplified code:

struct X
{
  typedef std::vector<int32_t> int_vec_t;
  int_vec_t buffer;

  X() : buffer( 5000000 ) { /* some more action */ }
  ~X() { /* some code here */ }

  // the following function is called 25 times per second
  const int_vec_t& process( int32_t background, const SOME_DATA& data );
};

const X::int_vec_t& X::process( int32_t background, const SOME_DATA& data )
{
    // the following one string takes 30% of total time of #process function
    std::fill( buffer.begin(), buffer.end(), background );

    // some processing
    // ...

    return buffer;
}

564

asked Jul 09 '10 12:07

Kirill V. Lyadvinsky

1 Answers

This is how I would do it (please excuse the Microsoft-ness of it):

VOID FillInt32(__out PLONG M, __in LONG Fill, __in ULONG Count)
{
    __m128i f;

    // Fix mis-alignment.
    if ((ULONG_PTR)M & 0xf)
    {
        switch ((ULONG_PTR)M & 0xf)
        {
            case 0x4: if (Count >= 1) { *M++ = Fill; Count--; }
            case 0x8: if (Count >= 1) { *M++ = Fill; Count--; }
            case 0xc: if (Count >= 1) { *M++ = Fill; Count--; }
        }
    }

    f.m128i_i32[0] = Fill;
    f.m128i_i32[1] = Fill;
    f.m128i_i32[2] = Fill;
    f.m128i_i32[3] = Fill;

    while (Count >= 4)
    {
        _mm_store_si128((__m128i *)M, f);
        M += 4;
        Count -= 4;
    }

    // Fill remaining LONGs.
    switch (Count & 0x3)
    {
        case 0x3: *M++ = Fill;
        case 0x2: *M++ = Fill;
        case 0x1: *M++ = Fill;
    }
}

163

answered Sep 30 '22 20:09

wj32

Related questions
                            
                                Does span propagate const?
                            
                                Difference between std::resize(n) and std::shrink_to_fit in C++?
                            
                                Why does C++20's requires expression not behave as expected?
                            
                                Virtual functions in constructors, why do languages differ?
                            
                                C++ function pointers and classes
                            
                                How to debug code that uses boost w/o losing sanity?
                            
                                What is the purpose of Browse Information generated by Visual Studio
                            
                                Checking if a registry key exists
                            
                                What's the point of _MERGE_PROXYSTUB?
                            
                                How do I start to use multithread programming?
                            
                                C++: Is there a way to define a static array inline?
                            
                                Overloading the global type conversion operator
                            
                                C or C++ for OpenGL graphics
                            
                                BOOST program_options: parsing multiple argument list
                            
                                Simplest way to get current time in current timezone using boost::date_time?
                            
                                forward declare static function c++
                            
                                Writing my own implementation of stl-like Iterator in C++
                            
                                the "new" operator in c++, pointer question
                            
                                C++ code generation with Python
                            
                                cmath compilation error when compiling old C++ code in VS2010

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to fill memory fast with a `int32_t` value?

Tags:

c++

memory

windows

visual-c++

winapi

Kirill V. Lyadvinsky

People also ask

1 Answers

wj32

Recent Activity

Donate For Us