Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Python - Element Tree is removing the XML declaration

I'm writing some XML with element tree.

I'm giving the code an empty template file that starts with the XML declaration:<?xml version= "1.0"?> when ET has finished making its changes and writes the completed XML its stripping out the declarion and starting with the root tag. How can I stop this?

Write call:

ET.ElementTree(root).write(noteFile)

like image 787
Jay Gattuso Avatar asked Sep 27 '12 01:09

Jay Gattuso


2 Answers

According to the documentation:

write(file, encoding="us-ascii", xml_declaration=None, method="xml")

Writes the element tree to a file, as XML. file is a file name, or a file object opened for writing. encoding 1 is the output encoding (default is US-ASCII). xml_declaration controls if an XML declaration should be added to the file. Use False for never, True for always, None for only if not US-ASCII or UTF-8 (default is None). method is either "xml", "html" or "text" (default is "xml"). Returns an encoded string.

So, write(noteFile) is explicitly telling it to write an XML declaration only if the encoding is not US-ASCII or UTF-8, and that the encoding is US-ASCII; therefore, you get no declaration.

I'm guessing if you didn't read this much, your next question is going to be "why is my Unicode broken", so let's fix both at once:

ET.ElementTree(root).write(noteFile, encoding="utf-8", xml_declaration=True)
like image 85
abarnert Avatar answered Sep 27 '22 16:09

abarnert


There are different versions of ElementTree. Some of them accept the xml_declaration argument, some do not.

The one I happen to have does not. It emits the declaration if and only if encoding != 'utf-8'. So, to get the declaration, I call write(filename, encoding='UTF-8').

like image 24
Olli Avatar answered Sep 27 '22 18:09

Olli