Is this a good reason to use alloca?

Tags:

alloca

I have the following function:

double 
neville (double xx, size_t n, const double *x, const double *y, double *work);

which performs Lagrange interpolation at xx using the n points stored in x and y. The work array has size 2 * n. Since this is polynomial interpolation, n is in the ballpark of ~5, very rarely more than 10.

This function is aggressively optimized, and supposed to be called in tight loops. Profiling suggests that heap allocating the work array in the loop is bad. Unfortunately, I'm supposed to package this into a function-like class, and clients must be unaware of the work array.

For now, I use a template integer argument for the degree and std::array to avoid dynamic allocation of the work array:

Click to copy

template <size_t n>
struct interpolator
{
    double operator() (double xx) const
    {
        std::array<double, 2 * n> work;
        size_t i = locate (xx); // not shown here, no performance impact
                                // due to clever tricks + nice calling patterns

        return neville (xx, n, x + i, y + i, work.data ());
    }        

    const double *x, *y;
};

It would have been possible to store the work array as a mutable member of the class, but operator() is supposed to be used concurrently by several threads. This version is OK provided you know n at compile time.

Now, I need the n parameter to be specified at run time. I am wondering about something like this:

Click to copy

double operator() (double xx) const
{
    auto work = static_cast<double*> (alloca (n * sizeof (double)));
    ...

Some bells ring when using alloca: I'm of course going to have a cap on n to avoid the alloca call to overflow (anyway it's quite stupid to use degree 100 polynomial interpolation).

I'm quite unconfortable with the approach however:

Am I missing some obvious danger of alloca ?
Is there a better way to avoid heap allocation here ?

607

asked Apr 30 '13 18:04

Alexandre C.

1 Answers

I'm quite unconfortable with the approach however:

Am I missing some obvious danger of alloca ?

You pointed the one real danger out: stack overflow behaviour is undefined for alloca. In addition, alloca isn’t actually standardised. For instance, Visual C++ has _alloca instead, and GCC by default defines it as a macro. That problem can be circumvented fairly easily, however, by providing a thin wrapper around the few existing implementations.

Is there a better way to avoid heap allocation here ?

Not really. C++14 will have a (potentially!) stack allocated variable-length array type. But until then, and when you consider std::array not to be a good fit, go for alloca in cases such as yours.

Minor nitpick though: your code is missing a cast of the return value of alloca. It shouldn’t even compile.

answered Oct 08 '22 18:10

Konrad Rudolph

Related questions
                            
                                Add an item in a container of smart pointers
                            
                                Is it possible to run unmanaged C++ normally from a managed C++/CLI project?
                            
                                std::move vs. compiler optimization
                            
                                Enumeration types in Node.js native addon
                            
                                templating virtual functions not possible. Only a temporary technical limitation?
                            
                                Why is a single thread faster than multiple threads even though they essentially have the same overhead?
                            
                                Get process ID by name
                            
                                How to get this pointer from std::function?
                            
                                how to build c++ project with scons 2.3 visual express 2012?
                            
                                What is the size of each element in std::list?
                            
                                Derived exception does not inherit constructors
                            
                                #include <iostream> in multiple files
                            
                                How impacting is "Packing" structures on performance
                            
                                Project Euler #23, can't find the issue in program
                            
                                Create thread within DLL
                            
                                get all controls by FindWindowEx
                            
                                Why it didn't need link libm?
                            
                                msvcprtd.lib(MSVCP100D.dll) : fatal error LNK1112: module machine type 'X86' conflicts with target machine type 'x64'
                            
                                Boost in Netbeans 7.1.1
                            
                                Using push_back on a vector<vector<string> > [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With