This is a question I ran into in school settings, but it keeps bugging me so I decided to ask it here. In Huffman compression, fixed-length sequences (characters) are encoded with variable-length sequences. The code sequence length depends on the frequencies (or probabilities) of the source characters. My questions is: what is the minimum highest character frequency, with which that character will be encoded by a single bit?

It turns out that the answer is 0.4, that is, if the highest frequency p is p >= 0.4, 1-bit code for the corresponding character is guaranteed. In other words, this is a sufficient condition. It is also true that p >= 1/3 is a necessary condition. That is, there may be examples where 0.4 > p >= 1/3, and the shortest code is 1-bit, but there are no cases like that if p < 1/3. The way to reason about that is to look at the way the code tree is constructed, in particular at the frequencies of the 3 last surviving subtrees. A proof appears in Johnsen, "On the redundancy of binary Huffman codes", 1980 (unfortunately this is a paid link).

Condition for single bit code for a character in Huffman code?

1 Answers

It turns out that the answer is 0.4, that is, if the highest frequency p is p >= 0.4, 1-bit code for the corresponding character is guaranteed. In other words, this is a sufficient condition.

It is also true that p >= 1/3 is a necessary condition. That is, there may be examples where 0.4 > p >= 1/3, and the shortest code is 1-bit, but there are no cases like that if p < 1/3.

The way to reason about that is to look at the way the code tree is constructed, in particular at the frequencies of the 3 last surviving subtrees. A proof appears in Johnsen, "On the redundancy of binary Huffman codes", 1980 (unfortunately this is a paid link).

104

answered Sep 21 '22 16:09

daphshez

Related questions
                            
                                Method to detect intersection between a rectangle and a polygon?
                            
                                Negative exponent with NumPy array operand
                            
                                Quaternion to Euler angles algorithm - How to convert to 'Y = Up' and between handedness?
                            
                                How can I detect if a float has a repeating decimal expansion in C#?
                            
                                GLSL refract function explanation available?
                            
                                How does this code find the number of trailing zeros from any base number factorial?
                            
                                How is Hard Sigmoid defined
                            
                                Finding the upper bound of a mathematical function (function analysis)
                            
                                C# Convert decimal to string with specify format
                            
                                Modulo operator in Objective-C returns the wrong result
                            
                                How to iterate over all subsets of a set of numbers that sum to around 0
                            
                                Get intersection point of rectangle and line
                            
                                3D Graphics Processing - How to calculate modelview matrix
                            
                                Adding 2 matrix and Multiplying 2 matrix in python by using scipy/numpy
                            
                                Fast Algorithm to find number of primes between two numbers
                            
                                Algorithm to determine the best team and formation?
                            
                                How to set up and solve simultaneous equations in python
                            
                                log-sum-exp trick why not recursive
                            
                                Android getOrientation Azimuth gets polluted when phone is tilted
                            
                                Multiple subset sum calculation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Condition for single bit code for a character in Huffman code?

Tags:

math

compression

huffman-code

daphshez

People also ask

1 Answers

daphshez

Recent Activity

Donate For Us