Hash Function For Sequence of Unique Ids (UUID)

Tags:

I am storing message sequences in the database each sequence can have up to N number of messages. I want to create a hash function which will represent the message sequence and enable to check faster if message sequence exists.

Each message has a case-sensitive alphanumeric universal unique id (UUID). Consider following messages (M1, M2, M3) with ids-

M1 - a3RA0000000e0taBB M2 - a3RA00033000e0taC M3 - a3RA0787600e0taBB

Message sequences can be

Sequence-1 : (M1,M2,M3) Sequence-2 : (M1,M3,M2) Sequence-3 : (M2,M1,M3) Sequence-4 : (M1,M2) Sequence-5 : (M2,M3) ...etc...

Following is the database structure example for storing message sequence

enter image description here

Given the message sequence, we need to check whether that message sequence exists in the database. For example, check if message sequence M1 -> M2 -> M3 i.e. with UIDs (a3RA0000000e0taBB -> a3RA00033000e0taC -> a3RA0787600e0taBB) exists in the database.

Instead of scanning the rows in the table, I want to create a hash function which represents the message sequence with a hash value. Using the hash value lookup in the table supposedly faster.

My simple hash function is- enter image description here

I am wondering what would be an optimal hash function for storing the message sequence hash for faster is exists check.

883

asked Aug 20 '18 22:08

Anwar Shaikh

1 Answers

You don't need a full-blown cryptographic hash, just a fast one, so how about having a look at FastHash: https://github.com/ZilongTan/Coding/tree/master/fast-hash. If you believe 32 or 64 bit hashes are not enough (i.e. produce too many collisions) then you could use the longer MurmurHash: https://en.wikipedia.org/wiki/MurmurHash (actually, the author of FastHash recommends this approach)

There's a list of more algorithms on Wikipedia: https://en.wikipedia.org/wiki/List_of_hash_functions#Non-cryptographic_hash_functions

In any case, hashes using bit operations (SHIFT, XOR ...) should be faster than the multiplication in your approach, even on modern machines.

answered Oct 09 '22 10:10

memo

Related questions
                            
                                Choose rectangles with maximal intersection area
                            
                                Interview challenge: Find the different elements in two arrays
                            
                                Fast method to check if a Matrix is singular? (non-invertible, det = 0)
                            
                                multi-way merge vs 2-way merge
                            
                                Sorting a point array efficiently in C?
                            
                                Implementing first fit like algorithm
                            
                                Merge ranges in intervals
                            
                                how to search for a given word from a huge database?
                            
                                Find simplest regular expression matching all given strings
                            
                                Image sharpness metric
                            
                                Knapsack: how to add item type to existing solution
                            
                                Offset/limit to page/size conversion
                            
                                Lexicographic minimum permutation such that all adjacent letters are distinct
                            
                                How to efficiently determine if a set of points contains two that are close
                            
                                Efficiently finding the largest surrounding square in 2D grid
                            
                                Get a sublist from an ArrayList efficiently
                            
                                Reordering a list to maximize difference of adjacent elements
                            
                                Bit operation used in a for loop
                            
                                Location based horizontal scalable dating app database model
                            
                                Optimized argmin: an effective way to find an item minimizing a function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Hash Function For Sequence of Unique Ids (UUID)

Tags:

algorithm

data-structures

hash

hash-function

Anwar Shaikh

People also ask

1 Answers

memo

Recent Activity

Donate For Us