Fixed point on a compression algorithm widely used nowadays

Tags:

I was wondering if there is a compression algorithm, in common use today, that contains a fixed point, i.e., an identity file.

To explain, let's call C : byte[] -> byte[] a function that represents the compression algorithm. I want to know if there exists (and what it is, if it is possible to determine in reasonable time) a file f such that

C(f) = f

That is, a file that, when compressed by a suitable, widely-known compression algorithm in common use nowadays, will produce itself as the result.

Do you know of such a phenomenon?

470

asked Aug 20 '09 12:08

Bruno Reis

2 Answers

Yes! This is a variant on the quine problem.

An example that uses gzip: http://groups.google.com/group/comp.compression/browse_thread/thread/c57c322e15c782aa/350d9fb166fdf11f
An example that uses zip/unzip: http://www.steike.com/code/useless/zip-file-quine/

107

answered Sep 24 '22 06:09

ire_and_curses

Warning: Rather pedantic answer.

There are many cases where D(f) = f (D being defined as decompression). However, compression is not as precisely defined. For most compression algorithms, different implementations of the compression algorithm will give different output files (of varying sizes). Consider two programs, 1 and 2. For full interoperability, it is necessary that D1(F) must equal D2(F) for all valid F. Similarly, it is necessary that D2(C2(f)) == D2(C1(F)) == D1(C1(F)) == D1(C2(F)), for all valid F. However it is totally unnecessary that C1(F) == C2(F), and this is in fact rarely the case.

So, you are unlikely to, if you actually compress such quines, to end up with the same file, unless you use the same program to do so that was used to generate it (which is unlikely, since such quines are usually hand-crafted, with C(F) never even being tested).

While it is possible (indeed, trivial!) to produce a program for which C(F) == F for some F, most people tend to instead point out as quines the more well-defined case where D(F) == F (since D1(F)==D2(F) for all valid, compatible decompression of the format of F, assuming F is valid).

So, there are likely cases where C(F) == F, but generally this is the wrong question to ask, and you should instead ask for cases where D(F) == F...which other people who answered the question have provided.

answered Sep 22 '22 06:09

Brian

Related questions
                            
                                Generate any number in the fewest step using multiply by 2 or divide by 3?
                            
                                Is there a pushable/poppable hash function for stack-like objects?
                            
                                Alibaba interview: print a sentence with min spaces
                            
                                What concepts or algorithms exist for parallelizing parsers?
                            
                                Optimal way to determine If it is possible to arrive at pair (c,d) when starting from (a,b)
                            
                                Trying to solve Sudoku with cvxpy
                            
                                How to efficiently find similar strings in a unique string in JavaScript?
                            
                                Divide array into sub arrays such that no sub array contains duplicate elements
                            
                                Why is my code not calculating the correct value for the expression string?
                            
                                Closure Number Method for Generate Parenthesis Problem
                            
                                Splitting an array finding minimum difference between the sum of two subarray in distributed environment
                            
                                The point that minimizes the sum of euclidean distances to a set of n points
                            
                                My algorithm for calculating the modulo of a very large fibonacci number is too slow
                            
                                Algorithm for miter joins on multiple lines
                            
                                Extracting Leaf paths from n-ary tree in F#
                            
                                Good algorithm for drawing solid 2-dimensional polygons?
                            
                                Efficient evaluation of hypergeometric functions
                            
                                Securing AJAX Requests via GUID
                            
                                Efficient way to generate random contingency tables?
                            
                                Algorithms or Patterns for reading text

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fixed point on a compression algorithm widely used nowadays

Tags:

algorithm

compression

fixed-point

Bruno Reis

People also ask

2 Answers

ire_and_curses

Brian

Recent Activity

Donate For Us