What's the most efficient bit vector compression method for my use case?

Tags:

I'm working on a project in computational biology and I need to store an index of locuses that differ between many sequences. For now, I'm using a B+Tree for that purpose, but I guess using a bitmap index would be way faster for such a use case : only a small number of locus differ between two sequences, 1% on average, and they are nearly equally distributed along the sequence; so it seems like there is a lot of room for bitmap index compression. My problem is that I cannot manage to find a compression method that can efficiently:

allow fast individual bit setting/unsetting
permit efficient range queries over the bitmap
possibly allow fast XOR-ing/AND-ing of two indexes

Thx in advance for your suggestions.

782

asked Jan 22 '11 15:01

fokenrute

1 Answers

Check out FastBit:

https://sdm.lbl.gov/fastbit/

answered Sep 20 '22 14:09

Noah Watkins

Related questions
                            
                                LLVM struct array iteration
                            
                                What is the memory attribute 'p' in my linker file?
                            
                                Mismatch between sizeof in ctypes.struct (packed) and packed struct in C
                            
                                C/Linux - Server <-> Terminal communication with named pipes
                            
                                Selecting a random node from a list in C
                            
                                Executing shellcode in shared memory with mmap [duplicate]
                            
                                Processing group policy with GP Extension
                            
                                Delay load python DLL when embedding python+numpy
                            
                                Launch GstRTSPServer from GstElement pipeline
                            
                                Why is this call to a pure function with a string literal argument not optimized to a constant value?
                            
                                What is the most conventional way to integrate C code into a Python library using distutils?
                            
                                Simple USB host stack
                            
                                Project Organization in C Best Practices
                            
                                Unit Testing - How to go about it?
                            
                                Are the benefits of SFIO over STDIO still valid?
                            
                                How does malloc_info() work?
                            
                                Do mmap/mprotect-readonly zero pages count towards committed memory?
                            
                                Finding all adjacent elements in a 2D array
                            
                                Asynchronous io in c using windows API: which method to use and why does my code execute synchronous?
                            
                                QEMU Crashes When Loading Kernel

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What's the most efficient bit vector compression method for my use case?

Tags:

c

indexing

bit-manipulation

bitmap

compression

fokenrute

People also ask

1 Answers

Noah Watkins

Recent Activity

Donate For Us