How can I compress a sequence of integers?

Tags:

I have an array which contains data within range -255 to +255.e.g. The array can be like this:

  int data[]={234,56,-4,24,56,78,23,89,234,68,-12,-253,45,128};

Here, order must be preserved while decompressing e.g. after 1st term 234, 56 must come.

So, what are the ways to compress any arbitrary sequence of numbers for which any repeating pattern can't be observed?

995

asked Sep 01 '12 11:09

3 Answers

A range of -255 to 255 means 511 values -> 9 bits. If you take the sign separately, 1 bit for sign and a byte for value.

You can write your array as a byte array, each byte value will be the absolute value of the related int.

In a separate zone (a long, or perhaps a byte array), store the sign bit.

134

answered Nov 14 '22 07:11

SJuan76

If there are truly no patterns in the data then a useful compression algorithm is impossible. Don't even bother trying!

Of course, in this case because all the numbers are in a restricted range n then you do have a pattern in the bits - namely that your high bits are either all 0 (positive) or all 1 (negative).

Standard compression algorithms like zip would therefore work if you want to compress reasonably effectively (assuming you have a long enough array of numbers to make it worthwhile).

Alternatively you can exploit the fact that you are effectively using 9-bit numbers. So you could roll your own compression algorithm by laying out the numbers as a long stream of 9-bit chunks and putting this into a byte array.

answered Nov 14 '22 08:11

mikera

In your situation (when repeating pattern can't be observed), variable-length coding may help you.

For example, Elias gamma-coding and Exponential-Golomb coding. The general idea - is that small numbers needs only few bits to be encoded. Exp-Golomb coding is used in the H.264/MPEG-4 AVC video compression standard. It is very easy to encode and decode sequences with it, also it is not very hard to implement this coding.

The way to code all integers is to set up a bijection, mapping integers (0, 1, -1, 2, -2, 3, -3, ...) to (1, 2, 3, 4, 5, 6, 7, ...) before coding.

For example:

Sequence (after bijection) [ 0, 2, 5, 8, 5, 2 ] would be encoded as 101100110000100100110011 - As you may see - there is no repeating patterns in this sequence, but it encoded only with 24 bits.

Short description of decoding process:

Read from input stream and count leading zero-bits (until you find non-zero bit) -> zero_bits_count
Read from input stream next ( zero_bits_count + 1 ) bits -> binary
Convert binary to decimal
Return ( decimal - 1 )

1... -> no leading zeros, zero_bits_count = 0 -> read next 1 bit -> [1]... -> [1] is 1 -> 1 - 1 = 0

011... -> [0] - one leading zero, zero_bits_count = 1 -> read next 2 bits -> [11]... -> [11] is 3 -> 3 - 1 = 2

00110... -> [00] - two leading zeros, zero_bits_count = 2 -> read next 3 bits -> [110]... -> [110] is 6 -> 6 - 1 = 5

etc.

answered Nov 14 '22 09:11

stemm

Related questions
                            
                                Getting trouble in installing tag lib in Apache tomcat7
                            
                                what is the use of throws Exception
                            
                                Best way to prevent concurrent modification exception
                            
                                welcome-file in web.xml with spring not working?
                            
                                m2eclipse is unable to locate C:\Program Files\Java\jre6\..\lib\tools.jar
                            
                                Encryption of Strings works, encryption of byte[] array type does not work
                            
                                enable screen rotation android
                            
                                What is the ampersand-equals operator used for in Java?
                            
                                Java - mouseMoved() event handling in Swing
                            
                                JComboBox not showing arrow
                            
                                Java Program to create a PNG waveform for an audio file
                            
                                Using String.endswith() method on Java
                            
                                How to locate the table id in Google Analytics?
                            
                                Will adding a column externally to a table mapped by JPA break normal functioning?
                            
                                How to set default values for Preferences in Eclipse RCP application
                            
                                Changing classpath in Eclipse
                            
                                Android: Out of Memory Exception / How does decodeResource add to the VM Budget?
                            
                                Is there something like C# Task in Java?
                            
                                What does <<= operator mean in Java?
                            
                                How is it possible to cast an Android Activity to an interface?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I compress a sequence of integers?

Tags:

java

arrays

multidimensional-array

arraylist

compression

Debadyuti Maiti

People also ask

3 Answers

SJuan76

mikera

stemm

Recent Activity

Donate For Us