I can't use boost:hash because I have to stick with C and can't use C++. But, I need to hash a large number (10K to 100k) of tokens strings (5 to 40 bytes length) so that search within those are fastest. MD5, SHA1 or any long hash function seems too heavy for a simple task, I am not doing cryptography. Plus there is the storage and computing cost. Therefore my question: <ol> <li>What might be the simplest hash algorithm that will ensure collision prevention in most practical cases.</li> <li>How many bit to use for the hash value? I am developing for 32 bit systems. Does hash algorithm in Perl/Python use 32 bit hashes too? Or do I have to jump to 64?</li> <li>Regarding implementation of hash tables in common scripting languages: does the implementation check for collisions or can I avoid that part altogether?</li> </ol>

You can find a good (and fast) hash function, and an interesting read, at http://www.azillionmonkeys.com/qed/hash.html The only time you should not check for collisions, is if you use a perfect hash -- a good old fashioned lookup table, like gperf.

A minimal hash function for C?

1 Answers

You can find a good (and fast) hash function, and an interesting read, at http://www.azillionmonkeys.com/qed/hash.html

The only time you should not check for collisions, is if you use a perfect hash -- a good old fashioned lookup table, like gperf.

163

answered Sep 27 '22 17:09

gnud

Related questions
                            
                                Can you allocate a very large single chunk of memory ( > 4GB ) in c or c++?
                            
                                How to get ip address from sock structure in c?
                            
                                debugging information cannot be found or does not match visual studio's
                            
                                Create a wrapper function for malloc and free in C
                            
                                Mod of power 2 on bitwise operators?
                            
                                ImportError: dynamic module does not define init function (initfizzbuzz)
                            
                                What is the purpose of anonymous { } blocks in C style languages?
                            
                                What are the differences between C, C# and C++ in terms of real-world applications? [closed]
                            
                                How do I make Sundown render blockquotes (lines that start with ">")
                            
                                Opposite of C preprocessor "stringification"
                            
                                WRITE_ONCE in linux kernel lists
                            
                                Combining several static libraries into one using CMake
                            
                                Sizeof vs Strlen
                            
                                Are there any plans for a future C standard after C11?
                            
                                Why does sizeof(x)++ compile? [duplicate]
                            
                                How does alloca() work on a memory level?
                            
                                Const vs Static Const
                            
                                Is this a correct and portable way of checking if 2 c-strings overlap in memory?
                            
                                Do function pointers need an ampersand [duplicate]
                            
                                Why does ld need -rpath-link when linking an executable against a so that needs another so?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

A minimal hash function for C?

Tags:

c

hashtable

hash

CDR

People also ask

1 Answers

gnud

Recent Activity

Donate For Us