I am taking a System Architecture course and I have trouble understanding how a direct mapped cache works. I have looked in several places and they explain it in a different manner which gets me even more confused. What I cannot understand is what is the Tag and Index, and how are they selected? The explanation from my lecture is: "Address divided is into two parts index (e.g 15 bits) used to address (32k) RAMs directly Rest of address, tag is stored and compared with incoming tag. " Where does that tag come from? It cannot be the full address of the memory location in RAM since it renders direct mapped cache useless (when compared with the fully associative cache). Thank you very much.

Okay. So let's first understand how the CPU interacts with the cache. There are three layers of memory (broadly speaking) - <code>cache</code> (generally made of <code>SRAM</code> chips), <code>main memory</code> (generally made of <code>DRAM</code> chips), and <code>storage</code> (generally magnetic, like hard disks). Whenever CPU needs any data from some particular location, it first searches the cache to see if it is there. Cache memory lies closest to the CPU in terms of memory hierarchy, hence its access time is the least (and cost is the highest), so if the data CPU is looking for can be found there, it constitutes a 'hit', and data is obtained from there for use by CPU. If it is not there, then the data has to be moved from the main memory to the cache before it can be accessed by the CPU (CPU generally interacts only with the cache), that incurs a time penalty. So to find out whether the data is there or not in the cache, various algorithms are applied. One is this direct mapped cache method. For simplicity, let's assume a memory system where there are 10 cache memory locations available (numbered 0 to 9), and 40 main memory locations available (numbered 0 to 39). This picture sums it up: <img src="https://i.stack.imgur.com/koS91.png" alt="enter image description here"> There are 40 main memory locations available, but only upto 10 can be accommodated in the cache. So now, by some means, the incoming request from CPU needs to be redirected to a cache location. That has two problems: <ol> <li> How to redirect? Specifically, how to do it in a predictable way which will not change over time? </li> <li> If the cache location is already filled up with some data, the incoming request from CPU has to identify whether the address from which it requires the data is same as the address whose data is stored in that location. </li> </ol> In our simple example, we can redirect by a simple logic. Given that we have to map 40 main memory locations numbered serially from 0 to 39 to 10 cache locations numbered 0 to 9, the cache location for a memory location <code>n</code> can be <code>n%10</code>. So 21 corresponds to 1, 37 corresponds to 7, etc. That becomes the index. But 37, 17, 7 all correspond to 7. So to differentiate between them, comes the tag. So just like index is <code>n%10</code>, tag is <code>int(n/10)</code>. So now 37, 17, 7 will have the same index 7, but different tags like 3, 1, 0, etc. That is, the mapping can be completely specified by the two data - tag and index. So now if a request comes for address location 29, that will translate to a tag of 2 and index of 9. Index corresponds to cache location number, so cache location no. 9 will be queried to see if it contains any data, and if so, if the associated tag is 2. If yes, it's a CPU hit and the data will be fetched from that location immediately. If it is empty, or the tag is not 2, it means that it contains the data corresponding to some other memory address and not 29 (although it will have the same index, which means it contains a data from address like 9, 19, 39, etc.). So it is a CPU miss, and data from location no. 29 in main memory will have to be loaded into the cache at location 9 (and the tag changed to 2, and deleting any data which was there before), after which it will be fetched by CPU.

How does direct mapped cache work?

1 Answers

Okay. So let's first understand how the CPU interacts with the cache.

There are three layers of memory (broadly speaking) - cache (generally made of SRAM chips), main memory (generally made of DRAM chips), and storage (generally magnetic, like hard disks). Whenever CPU needs any data from some particular location, it first searches the cache to see if it is there. Cache memory lies closest to the CPU in terms of memory hierarchy, hence its access time is the least (and cost is the highest), so if the data CPU is looking for can be found there, it constitutes a 'hit', and data is obtained from there for use by CPU. If it is not there, then the data has to be moved from the main memory to the cache before it can be accessed by the CPU (CPU generally interacts only with the cache), that incurs a time penalty.

So to find out whether the data is there or not in the cache, various algorithms are applied. One is this direct mapped cache method. For simplicity, let's assume a memory system where there are 10 cache memory locations available (numbered 0 to 9), and 40 main memory locations available (numbered 0 to 39). This picture sums it up:

enter image description here

There are 40 main memory locations available, but only upto 10 can be accommodated in the cache. So now, by some means, the incoming request from CPU needs to be redirected to a cache location. That has two problems:

How to redirect? Specifically, how to do it in a predictable way which will not change over time?
If the cache location is already filled up with some data, the incoming request from CPU has to identify whether the address from which it requires the data is same as the address whose data is stored in that location.

In our simple example, we can redirect by a simple logic. Given that we have to map 40 main memory locations numbered serially from 0 to 39 to 10 cache locations numbered 0 to 9, the cache location for a memory location n can be n%10. So 21 corresponds to 1, 37 corresponds to 7, etc. That becomes the index.

But 37, 17, 7 all correspond to 7. So to differentiate between them, comes the tag. So just like index is n%10, tag is int(n/10). So now 37, 17, 7 will have the same index 7, but different tags like 3, 1, 0, etc. That is, the mapping can be completely specified by the two data - tag and index.

So now if a request comes for address location 29, that will translate to a tag of 2 and index of 9. Index corresponds to cache location number, so cache location no. 9 will be queried to see if it contains any data, and if so, if the associated tag is 2. If yes, it's a CPU hit and the data will be fetched from that location immediately. If it is empty, or the tag is not 2, it means that it contains the data corresponding to some other memory address and not 29 (although it will have the same index, which means it contains a data from address like 9, 19, 39, etc.). So it is a CPU miss, and data from location no. 29 in main memory will have to be loaded into the cache at location 9 (and the tag changed to 2, and deleting any data which was there before), after which it will be fetched by CPU.

186

answered Sep 21 '22 04:09

SexyBeast

Related questions
                            
                                Difference between django-redis-cache and django-redis for redis caching with Django?
                            
                                How to use JPA2's @Cacheable instead of Hibernate's @Cache
                            
                                IIS and Static content?
                            
                                Is there an API to force Facebook to scrape a page again?
                            
                                is Rails.cache purged between tests?
                            
                                Is something wrong with the dynamic keyword in C# 4.0?
                            
                                Linux C++: how to profile time wasted due to cache misses?
                            
                                Stop browser to make HTTP requests for images that should stay cached - mod_expires
                            
                                How to cache Google map tiles for offline usage?
                            
                                how to clear or replace a cached image
                            
                                What happens to SDWebImage Cached Images in my app when the image file on the server changes?
                            
                                Do current x86 architectures support non-temporal loads (from "normal" memory)?
                            
                                Chrome keeps loading a old cache of my website
                            
                                How to force DNS refresh for a website?
                            
                                When does browser automatically clear cache of external JavaScript file?
                            
                                Android Volley + JSONObjectRequest Caching
                            
                                Best way to cache images on ios app?
                            
                                Make @lru_cache ignore some of the function arguments
                            
                                Spring cache @Cacheable method ignored when called from within the same class
                            
                                Caching query results in django

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does direct mapped cache work?

Tags:

cpu-architecture

caching

system

Percentage

People also ask

1 Answers

SexyBeast

Recent Activity

Donate For Us