What causes a L3 cache miss in CPU?

Tags:

I have a question regarding the relation between cache misses of difference cache levels in a x86 architecture (Say Xeon X5660).

I did some profiling over an OpenCL application (Blackscholes), on some performance counters. For each counter, I sum up all the values over all cores and get this result:

 instructions #: 493167746502.000000 

 L3_MISS #: 1967809.000000 

 L1_MISS  #: 2344383795.000000 

 L2_DATA_MISS #: 901131.000000 

 L2_MISS #: 1397931.000000 

 memory loads #: 151559373227.000000

The question is why the number of L3 misses is bigger than the number of L2 misses? (I keep rerunning the profiling many times and the variance is not significant). What I thought basically is:

L2 misses = L3 hits + L3 misses

Could someone explain me what goes wrong here, did I miss something?

Putting it a bit further, what causes a cache read for the last level cache (CPU) of CPU? Is it just simply a data miss from L2?

Thanks

401

asked May 02 '12 13:05

Zk1001

1 Answers

The 32 nanometer, six core Westmere-EP chip

Image Ref : http://www.theregister.co.uk/2010/02/03/intel_westmere_ep_preview/

As you can see above, In 'Westmere-EP' architecture block of 3 cores share a section of L3 cache. So what "boiler96" says makes sense. You are either getting L2 misses for individual core or your L3 miss count is coming from Uncore which is combined miss count of misses from all cores.

102

answered Sep 18 '22 06:09

dvishal

Related questions
                            
                                How to control memory usage when calling multiple WebView in Android?
                            
                                Disable/Flush OleDbConnection Cache
                            
                                How to enable expires-header caching for webview
                            
                                Feature-detect bfcache?
                            
                                Ehcache, Redis and Gemfire which Cache for which Scenario?
                            
                                Should I create a new CacheItemPolicy for every item I add to a System.Runtime.Caching.ObjectCache?
                            
                                NSURLCache does not work when response header value for transfer-encoding is chunked
                            
                                Doctrine ORM Caching in ZF2 Application
                            
                                Spring session on Redis - what is the failover when Redis is down
                            
                                Use OutputCache and GetVaryByCustomString to cache same content for multiple paths
                            
                                Flutter video caching for 10 seconds on next 4 videos
                            
                                Hibernate: Clean collection's 2nd level cache while cascade delete items
                            
                                Any reason not to use USE_ETAGS with CommonMiddleware in Django?
                            
                                Is it OK to set "Cache-Control: public" when sending “304 Not Modified” for images stored in the datastore
                            
                                Why is PHP discriminating between .php and .abc extensions for caching?
                            
                                CakePHP 2.1: Browser cache vs View cache

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What causes a L3 cache miss in CPU?

Tags:

memory

caching

profiling

cpu

opencl

Zk1001

People also ask

1 Answers

dvishal

Recent Activity

Donate For Us