Is there any workaround to "reserve" a cache fraction?

Tags:

Assume I have to write a C or C++ computational intensive function that has 2 arrays as input and one array as output. If the computation uses the 2 input arrays more often than it updates the output array, I'll end up in a situation where the output array seldom gets cached because it's evicted in order to fetch the 2 input arrays.

I want to reserve one fraction of the cache for the output array and enforce somehow that those lines don't get evicted once they are fetched, in order to always write partial results in the cache.

Update1(output[]) // Output gets cached
DoCompute1(input1[]); // Input 1 gets cached
DoCompute2(input2[]); // Input 2 gets cached
Update2(output[]); // Output is not in the cache anymore and has to get cached again
...

I know there are mechanisms to help eviction: clflush, clevict, _mm_clevict, etc. Are there any mechanisms for the opposite?

I am thinking of 3 possible solutions:

Using _mm_prefetch from time to time to fetch the data back if it has been evicted. However this might generate unnecessary traffic plus that I need to be very careful to when to introduce them;
Trying to do processing on smaller chunks of data. However this would work only if the problem allows it;
Disabling hardware prefetchers where that's possible to reduce the rate of unwanted evictions.

Other than that, is there any elegant solution?

352

asked Apr 26 '15 18:04

VAndrei

1 Answers

Intel CPUs have something called No Eviction Mode (NEM) but I doubt this is what you need.

While you are attempting to optimise the second (unnecessary) fetch of output[], have you given thought to using SSE2/3/4 registers to store your intermediate output values, update them when necessary, and writing them back only when all updates related to that part of output[] are done? I have done something similar while computing FFTs (Fast Fourier Transforms) where part of the output is in registers and they are moved out (to memory) only when it is known they will not be accessed anymore. Until then, all updates happen to the registers. You'll need to introduce inline assembly to effectively use SSE* registers. Of course, such optimisations are highly dependent on the nature of the algorithm and data placement.

200

answered Nov 15 '22 17:11

pavan

Related questions
                            
                                How to implement CRC32 taking advantage of Intel specific instructions? [duplicate]
                            
                                Why does {} work while () doesn't in initializing an atomic object here? [duplicate]
                            
                                Substitution failure in an atomic constraint of template function requires-clause
                            
                                How to develop small software or application? [closed]
                            
                                Boost::asio, Shared Memory and Interprocess Communication
                            
                                How can I wrap a c++ class in php extension?
                            
                                alias template substitution and deduction failure with gcc
                            
                                How much does the C standard library extensibility affect C++ programs?
                            
                                Why c++ standard support function strftime but not strptime?
                            
                                packaging c++ program using boost libraries with cmake/cpack
                            
                                How to get AssImp to work properly?
                            
                                why does a conditional variable fix our power consumption?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there any workaround to "reserve" a cache fraction?

Tags:

c++

c

memory-management

optimization

caching

VAndrei

People also ask

1 Answers

pavan

Recent Activity

Donate For Us