It is possible for an operating system to determine whether a page of memory is in DRAM or in swap; for example, simply try to access it and if a page fault occurs, it wasn't. However, is the same thing possible with CPU cache? Is there any efficient way to tell whether a given memory location has been loaded into a cache line, or to know when it does so?

If you try to determine this yourself then the very act of running your program could invalidate the relevant cache lines, hence rendering your measurements useless. This is one of those cases that mirrors the scientific principle that you cannot measure something without affecting that which you are measuring.

Determine whether memory location is in CPU cache

2 Answers

In general, I don't think this is possible. It works for DRAM and the pagefile since that is an OS managed resource, cache is managed by the CPU itself.

The OS could do a tight timing loop of a memory read and try to see if it completes fast enough to be in the cache or if it had to go out to main memory - this would be very error prone.

On multi-core/multi-proc systems, there are cache coherency protocols that are used between processors to determine when to they need to invalidate each other's caches, I suppose you could have a custom device that would snoop this protocol that the OS would query.

What are you trying to do? If you want to force something into memory, current x86 processors support prefetching memory into the cache in a non-blocking way, for instance with Visual C++ you could use _mm_prefetch to fetch a line into the cache.

EDIT: I haven't done this myself, so use at your own risk. To determine cache misses for profiling, you may be able to use some architecture-specific registers. http://download.intel.com/design/processor/manuals/253669.pdf, Appendix A gives "Performance Tuning Events". This can't be used to determine if an individual address is in the cache or when it is loaded in the cache, but can be used for overall stats. I believe this is what vTune (a phenomenal profiler for this level) uses.

130

answered Oct 20 '22 23:10

Michael

If you try to determine this yourself then the very act of running your program could invalidate the relevant cache lines, hence rendering your measurements useless.

This is one of those cases that mirrors the scientific principle that you cannot measure something without affecting that which you are measuring.

answered Oct 20 '22 23:10

Alnitak

Related questions
                            
                                How does copy-on-write in fork() handle multiple fork?
                            
                                How are the "money" and "decimal" data types in SQL Server stored in memory?
                            
                                what is the optimal chunksize in pandas read_csv to maximize speed?
                            
                                Is a processes memory reclaimed when it terminates?
                            
                                Linux Core Dump Without Killing Process
                            
                                How to decide between SQLite database vs. in-memory usage
                            
                                How can I load large files (~150MB) in MATLAB?
                            
                                Using multiple allocators efficiently
                            
                                Why Enable/Disable A20 Line
                            
                                The memory-efficient way of using Core Image on iOS?
                            
                                Problems with Java garbage collector and memory
                            
                                Apache crashes with munmap_chunk(): invalid pointer after update to php7 on Jessie
                            
                                How is its lifetime of a return value extended to the scope of the calling function when it is bound to a const reference in the calling function?
                            
                                Read a tiff file's dimension and resolution without loading it first
                            
                                Memory profiler for C [closed]
                            
                                Desperately seeking the answer to my pointer problem
                            
                                CUDA texture memory space
                            
                                Python high memory usage with BeautifulSoup
                            
                                MySQL user created temporary table is full
                            
                                How are different types stored in memory

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Determine whether memory location is in CPU cache

Tags:

memory

caching

assembly

cpu

fault

Mike A

People also ask

2 Answers

Michael

Alnitak

Recent Activity

Donate For Us