Best compression algorithm for XML?

Tags:

I barely know a thing about compression, so bear with me (this is probably a stupid and painfully obvious question).

So lets say I have an XML file with a few tags.

<verylongtagnumberone>   <verylongtagnumbertwo>     text   </verylongtagnumbertwo> </verylongtagnumberone>

Now lets say I have a bunch of these very long tags with many attributes in my multiple XML files. I need to compress them to the smallest size possible. The best way would be to use an XML-specific algorithm which assigns individual tags pseudonyms like vlt1 or vlt2. However, this wouldn't be as 'open' of a way as I m trying to go for, and I want to use a common algorithm like DEFLATE or LZ. It also helpes if the archive was a .zip file.

Since I'm dealing with plain text (no binary files like images), I'd like an algorithm that suits plain text. Which one produces the smallest file size (lossless algorithms are preferred)?

By the way, the scenario is this: I am creating a standard for documents, like ODF or MS Office XML, that contain XML files, packaged in a .zip.

EDIT: The 'encryption' thing was a typo; it should ave ben 'compression'.

871

asked Jul 04 '09 14:07

Aethex

1 Answers

There is a W3 (not-yet-released) standard named EXI (Efficient XML Interchange).

Should become THE data format for compressing XML data in the future (claimed to be the last necessary binary format). Being optimized for XML, it compresses XML more ways more efficient than any conventional compression algorithm.

With EXI, you can operate on compressed XML data on the fly (without the need to uncompress or re-compress it).

EXI = (XML + XMLSchema) as binary.

And here you go with the opensource implementation (don't know if it's already stable):
Exificient

108

answered Sep 19 '22 21:09

ivan_ivanovich_ivanoff

Related questions
                            
                                Validating XML with XSDs ... but still allow extensibility
                            
                                Serialize a C# class to XML with attributes and a single value for the class
                            
                                What is the fastest way to combine two xml files into one
                            
                                XML file encoding format "utf-8" VS "UTF-8"?
                            
                                Add multiple custom views to layout programmatically
                            
                                How to preserve an ampersand (&) while using FOR XML PATH on SQL 2005
                            
                                Android Custom View Constructor
                            
                                Loop through all elements in XML using NodeList
                            
                                Scroll behavior in nested RecyclerView with horizontal scroll
                            
                                How to Read XML in .NET?
                            
                                Required Multiple beans of same type in Spring
                            
                                How to add shadow around circular imageview
                            
                                When and Why is XML preferable to CSV? [closed]
                            
                                How to change direction of android elevation shadow?
                            
                                Any experiences with Protocol Buffers?
                            
                                use xsl to output plain text
                            
                                Serializing Lists of Classes to XML
                            
                                Implementing a custom Decoder in Swift 4
                            
                                Tools for debugging xslt
                            
                                What are the C# documentation tags? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Best compression algorithm for XML?

Tags:

text

algorithm

xml

compression

zip

Aethex

People also ask

1 Answers

ivan_ivanovich_ivanoff

Recent Activity

Donate For Us