Cache invalidation strategy

Tags:

In my current application, we are dealing with some information which rarely changes.
For performance optimization, we want to store them in the cache.
But the problem is in invaliding these objects whenever these are updated.
We have not finalized the caching product.
As we are building this application on Azure, we will probably use Azure Redis cache.
One strategy could be to add code in Update API which will invalidate object in the cache.
I am not sure if this is a clean way?
We do not want to go with Cache Expiration based on time (TTL).
Could you please suggest some other strategies used for cache invalidation?

740

asked May 11 '15 11:05

Pragmatic

1 Answers

Invalidate the cache during the Update stage is a viable approach, and was extremely used in the past.

You have two options here when the UPDATE happens:

You may try to set the new value during update operation, or
Just delete the old one and update during a read operation.

If you want an LRU cache, then UPDATE may just delete the old value, and the first time the object will be fetched, you'll create it again after the read from the actual database. However, if you know that your cache is very small and you are using another main database for concerns different than data size, you may update directly during UPDATE.

However, all this is not enough to be completely consistent.
When you write to your DB, the Redis cache may be unavailable for a few seconds for example, so data remains not synchronized between the two.
What do you do in that case?
There are several options you could use at the same time.

Set a TTL anyway, so that eventually broken data is refreshed.
Use lazy read repair. When you read from the DB, from time to time check with the primary if the value matches. If not update the cached item (or delete it).
Use epochs or similar ways to access your data. Not always possible, however sometimes you access cached data about a given object. When possible you may change the object ID/handle every time you modify it, so that it is impossible that you access stale data in the cache: every key name refers to a specific version of your object.

So the del-cache-on-update and write-cache-on-read is the basic strategy, but you can employ other additional systems to eventually repair the inconsistencies.

There is another option actually instead of using the above options, which is to have a background process using Redis SCAN to verify key by key if there are inconsistencies. This process can be slow and can run against a replica of your database.

As you can see here the main idea is always the same: if an update to the cache fails, don't make it a permanent issue that will remain there potentially forever, give it a chance to fix itself at a later time.

answered Sep 28 '22 11:09

antirez

Related questions
                            
                                Why is DFS slower in one tree and faster in the other?
                            
                                When to cache Tasks?
                            
                                What can cause a program to run much faster the second time?
                            
                                How to clear the whole cache when using django's page_cache decorator
                            
                                Post-loading : check if an image is in the browser cache
                            
                                Forcing browsers to reload Silverlight xap after an update
                            
                                HttpRuntime.Cache[] vs Application[]
                            
                                Why does ConcurrentDictionary.GetOrAdd(key, valueFactory) allow the valueFactory to be invoked twice?
                            
                                Measuring Cache Latencies
                            
                                Avoid caching of the http responses
                            
                                Redis or Ehcache?
                            
                                Rails 4.0 expire_fragment/cache expiration not working
                            
                                Tuple vs string as a Dictionary key in C#
                            
                                Any way to clear/flush/remove OutputCache?
                            
                                C# WebClient disable cache
                            
                                Cache or Registers - which is faster?
                            
                                Where are .NET 4.0 MemoryCache performance counters?
                            
                                WebBrowser control caching issue
                            
                                Inconsistent cache values using Zend Cache with AWS ElastiCache across multiple servers
                            
                                Using Spring cache annotation in multiple modules

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cache invalidation strategy

Tags:

caching

redis

Pragmatic

People also ask

1 Answers

antirez

Recent Activity

Donate For Us