How to output CDATA using ElementTree

Tags:

I've discovered that cElementTree is about 30 times faster than xml.dom.minidom and I'm rewriting my XML encoding/decoding code. However, I need to output XML that contains CDATA sections and there doesn't seem to be a way to do that with ElementTree.

Can it be done?

280

asked Oct 06 '08 15:10

elifiner

2 Answers

After a bit of work, I found the answer myself. Looking at the ElementTree.py source code, I found there was special handling of XML comments and preprocessing instructions. What they do is create a factory function for the special element type that uses a special (non-string) tag value to differentiate it from regular elements.

def Comment(text=None):     element = Element(Comment)     element.text = text     return element

Then in the _write function of ElementTree that actually outputs the XML, there's a special case handling for comments:

if tag is Comment:     file.write("<!-- %s -->" % _escape_cdata(node.text, encoding))

In order to support CDATA sections, I create a factory function called CDATA, extended the ElementTree class and changed the _write function to handle the CDATA elements.

This still doesn't help if you want to parse an XML with CDATA sections and then output it again with the CDATA sections, but it at least allows you to create XMLs with CDATA sections programmatically, which is what I needed to do.

The implementation seems to work with both ElementTree and cElementTree.

import elementtree.ElementTree as etree #~ import cElementTree as etree  def CDATA(text=None):     element = etree.Element(CDATA)     element.text = text     return element  class ElementTreeCDATA(etree.ElementTree):     def _write(self, file, node, encoding, namespaces):         if node.tag is CDATA:             text = node.text.encode(encoding)             file.write("\n<![CDATA[%s]]>\n" % text)         else:             etree.ElementTree._write(self, file, node, encoding, namespaces)  if __name__ == "__main__":     import sys      text = """     <?xml version='1.0' encoding='utf-8'?>     <text>     This is just some sample text.     </text>     """      e = etree.Element("data")     cdata = CDATA(text)     e.append(cdata)     et = ElementTreeCDATA(e)     et.write(sys.stdout, "utf-8")

answered Sep 17 '22 15:09

elifiner

lxml has support for CDATA and API like ElementTree.

answered Sep 17 '22 15:09

iny

Related questions
                            
                                Python logging file config KeyError: 'formatters'
                            
                                obtaining last value of dataframe column without index
                            
                                Error message "python-pylint 'C0103:Invalid constant name"
                            
                                How to use Python decorators to check function arguments?
                            
                                Python vectorizing nested for loops
                            
                                Is it possible to change the model name in the django admin site?
                            
                                Slicing a list into n nearly-equal-length partitions [duplicate]
                            
                                Django and query string parameters
                            
                                Why doesn't Python evaluate constant number arithmetic before compiling to bytecode?
                            
                                Appending two dataframes with same columns, different order
                            
                                Overloading __dict__() on python class
                            
                                pandas groupby and join lists
                            
                                Reading input sound signal using Python
                            
                                How to install dependencies from a copied pipfile inside a virtual environment?
                            
                                How to exit when viewing python help like help(os.listdir)
                            
                                Function with varying number of For Loops (python)
                            
                                Saving numpy array to txt file row wise
                            
                                python regex first/shortest match
                            
                                How do I test dictionary-equality with Python's doctest-package?
                            
                                setting up s3 for logs in airflow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to output CDATA using ElementTree

Tags:

python

xml

elifiner

People also ask

2 Answers

elifiner

iny

Recent Activity

Donate For Us