Efficient implementation of a Bloom filter in C?

Tags:

bloom-filter

This question has been asked previously but there was no answer for it at that time so I decided to ask it again.

I need an efficient implementation of a Bloom filter in C (not C++). If there is no such thing available, I would not mind implementing one, if given some good reference so that it doesn't take too much of my time.

I want to use this data structure for inserts and tests in a ratio (1:20k), so primarily it is test-intensive. The data to be tested is 64 bit integers.

705

asked Jun 13 '12 02:06

Aman Deep Gautam

2 Answers

I have a stand-alone plain C library here which may be of use: https://github.com/jvirkki/libbloom

139

answered Oct 04 '22 23:10

Jyri J. Virkki

Not to do too much self-promotion, but I've written a plugin for the Geany editor/IDE that filters out duplicate text lines, it uses a Bloom filter.

The implementation is in C, and you can find it right here on GitHub. It's GPL v3, so depending on your exact needs you may or may not be able to use it.

Some notes about my implementation:

It's designed to filter strings, and doesn't abstract the key type. This means you're going to have to modify the key handling to suit your needs.
It supports un-characteristic semantics, you can actually use it for totally non-probabilistic existance-testing if you want to (see the BloomContains callback function pointer used by bloom_filter_new()). Just pass NULL to get a "pure" filter.
The string hash function is MurmurHash2 by Austin Appleby. I evaluated the more current MurmurHash3, but version 2 was easier to work with.
To fit in the Geany eco system, this code uses GLib types throughout.

It hasn't been heavily tuned for performance, but should be okay. I would appreciate any feedback you might have after testing it, of course!

answered Oct 05 '22 00:10

unwind

Related questions
                            
                                Get the PID of the process started by CreateProcess()
                            
                                Accessing specific memory locations in C
                            
                                Lightweight GNU readline alternative
                            
                                Difference between Linux errno 23 and Linux errno 24
                            
                                Why is there no "recalloc" in the C standard?
                            
                                Why does the executable binary file contain paths of included header files?
                            
                                Why does the Linux kernel #define a symbol as itself?
                            
                                Writing cross-platform apps in C
                            
                                Globally override malloc in visual c++
                            
                                Decoding a JPEG Huffman block (table)
                            
                                Extract the fields of a C struct
                            
                                How to allow certain threads to have priority in locking a mutex use PTHREADS
                            
                                How to set up the Eclipse for remote C debugging with gdbserver?
                            
                                K&R style function definition problem
                            
                                Why are function declaration mandatory in C++ and not in C?
                            
                                Calling Cocoa APIs from C
                            
                                How to implement a good debug/logging feature in a project
                            
                                Is it safe to call pthread_cancel() on terminated thread?
                            
                                Turn on core/crash dumps programmatically
                            
                                How to set decode pixel format in libavcodec?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With