I am looking at the Item[TKey]
source of Dictionary<TKey,TValue>
trying to understand the mechanism of storage/retrieval in a dictionary, and why it is faster than just checking each entry one by one.
Where I get confused is in the user of prime numbers in buckets
field and the interplay with Entry<TKey,TValue>.next
.
Can someone explain to me the logic, or point to a reference where I can understand it.
Thanks.
See Hash Tables and possibly this article about using prime numbers in hash table implementations.
Look at this Wikipedia article: Hashtable
A dictionary is just a strongly-typed implementation of a hashtable. It has a number of "buckets" where it puts the items with their keys.
When you add an item to the hashtable, it uses the hashcode of the key to decide in which bucket it's going to put it (that's normally a very fast operation, just call GetHashCode
and apply a modulo to it). Once it has the bucket (which is some kind of list), it checks if the bucket already contains an item with the same key, and adds it if it's not the case. This is also pretty fast, because each bucket contains only a small subset of all the items in the hashtable.
When you want to retrieve an item based on its key, it determines the bucket based on the hashcode of the key, and looks for an item with this key in the bucket.
Of course, this is a very simplistic description... look at the Wikipedia article for more details.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With