compressed string storage

Tags:

Lets say I have many objects containing strings of non-trivial length (around ~3-4kb). The strings are all different from each other yet at the same time contain lots of common parts/subsequences. On average maybe 80-90% of any individual string is contained withing the others as well. Is there an easy way to automatically exploit this huge redundancy for compressing the data?
Ideally the solution would be C++ and transparent for the user (i.e. I can use it as if I was accessing a regular read only const std::string but instead reading from compressed storage).

616

asked Dec 03 '10 09:12

BuschnicK

1 Answers

Algorithmically, Lempel–Ziv–Welch with one dictionary for all objects/strings might be a good start.

100

answered Sep 20 '22 20:09

NPE

Related questions
                            
                                Compiling Quantlib via SWIG for C#
                            
                                C/C++ - posix_memalign()
                            
                                How does boost::ptr_vector deep copy the underlying objects?
                            
                                How to detect header changes in make dependency list
                            
                                Thread related issues and debugging them
                            
                                Difference between double comparisons in gtest (C++) and nunit (C#)
                            
                                C++ and Java objects communication
                            
                                Arguments for and against supporting std::wstring exclusively in cross-platform library
                            
                                Unqualified lookup and (maybe-)dependent base classes
                            
                                Qt: Best way to implement "oscilloscope-like" realtime-plotting
                            
                                How do I read a FIFO/named pipe line by line from a C++/Qt Linux app?
                            
                                Assigning a depth to each node
                            
                                Sending a C++ struct over UDP in Java
                            
                                How to get all properties/variables of a class at runtime/dynamically in C++
                            
                                How does stringstream work internally?
                            
                                Finding the angles for the X, Y and Z axis in 3D - OpenGL/C++
                            
                                C++ equivalent of C# 4.0's "dynamic" keyword?
                            
                                memory management & std::allocator
                            
                                Convert 128-bit hexadecimal string to base-36 string
                            
                                Boost tuple performance

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

compressed string storage

Tags:

c++

string

algorithm

data-structures

compression

BuschnicK

People also ask

1 Answers

NPE

Recent Activity

Donate For Us