Convert Python ElementTree to string

How do I convert `ElementTree.Element` to a String?

For Python 3:

xml_str = ElementTree.tostring(xml, encoding='unicode')

For Python 2:

xml_str = ElementTree.tostring(xml, encoding='utf-8')

The following is compatible with both Python 2 & 3, but only works for Latin characters:

xml_str = ElementTree.tostring(xml).decode()

Example usage

from xml.etree import ElementTree

xml = ElementTree.Element("Person", Name="John")
xml_str = ElementTree.tostring(xml).decode()
print(xml_str)

Output:

<Person Name="John" />

Explanation

Despite what the name implies, ElementTree.tostring() returns a bytestring by default in Python 2 & 3. This is an issue in Python 3, which uses Unicode for strings.

In Python 2 you could use the str type for both text and binary data. Unfortunately this confluence of two different concepts could lead to brittle code which sometimes worked for either kind of data, sometimes not. [...]

To make the distinction between text and binary data clearer and more pronounced, [Python 3] made text and binary data distinct types that cannot blindly be mixed together.

^{Source: Porting Python 2 Code to Python 3}

If we know what version of Python is being used, we can specify the encoding as unicode or utf-8. Otherwise, if we need compatibility with both Python 2 & 3, we can use decode() to convert into the correct type.

For reference, I've included a comparison of .tostring() results between Python 2 and Python 3.

ElementTree.tostring(xml)
# Python 3: b'<Person Name="John" />'
# Python 2: <Person Name="John" />

ElementTree.tostring(xml, encoding='unicode')
# Python 3: <Person Name="John" />
# Python 2: LookupError: unknown encoding: unicode

ElementTree.tostring(xml, encoding='utf-8')
# Python 3: b'<Person Name="John" />'
# Python 2: <Person Name="John" />

ElementTree.tostring(xml).decode()
# Python 3: <Person Name="John" />
# Python 2: <Person Name="John" />

Thanks to Martijn Peters for pointing out that the str datatype changed between Python 2 and 3.

Why not use str()?

In most scenarios, using str() would be the "cannonical" way to convert an object to a string. Unfortunately, using this with Element returns the object's location in memory as a hexstring, rather than a string representation of the object's data.

from xml.etree import ElementTree

xml = ElementTree.Element("Person", Name="John")
print(str(xml))  # <Element 'Person' at 0x00497A80>

Non-Latin Answer Extension

Extension to @Stevoisiak's answer and dealing with non-Latin characters. Only one way will display the non-Latin characters to you. The one method is different on both Python 3 and Python 2.

Input

xml = ElementTree.fromstring('<Person Name="크리스" />')
xml = ElementTree.Element("Person", Name="크리스")  # Read Note about Python 2

NOTE: In Python 2, when calling the toString(...) code, assigning xml with ElementTree.Element("Person", Name="크리스")will raise an error...

UnicodeDecodeError: 'ascii' codec can't decode byte 0xed in position 0: ordinal not in range(128)

Output

ElementTree.tostring(xml)
# Python 3 (크리스): b'<Person Name="&#53356;&#47532;&#49828;" />'
# Python 3 (John): b'<Person Name="John" />'

# Python 2 (크리스): <Person Name="&#53356;&#47532;&#49828;" />
# Python 2 (John): <Person Name="John" />


ElementTree.tostring(xml, encoding='unicode')
# Python 3 (크리스): <Person Name="크리스" />             <-------- Python 3
# Python 3 (John): <Person Name="John" />

# Python 2 (크리스): LookupError: unknown encoding: unicode
# Python 2 (John): LookupError: unknown encoding: unicode

ElementTree.tostring(xml, encoding='utf-8')
# Python 3 (크리스): b'<Person Name="\xed\x81\xac\xeb\xa6\xac\xec\x8a\xa4" />'
# Python 3 (John): b'<Person Name="John" />'

# Python 2 (크리스): <Person Name="크리스" />             <-------- Python 2
# Python 2 (John): <Person Name="John" />

ElementTree.tostring(xml).decode()
# Python 3 (크리스): <Person Name="&#53356;&#47532;&#49828;" />
# Python 3 (John): <Person Name="John" />

# Python 2 (크리스): <Person Name="&#53356;&#47532;&#49828;" />
# Python 2 (John): <Person Name="John" />

Related questions
                            
                                How to change backends in matplotlib / Python
                            
                                Which is the best way to allow configuration options be overridden at the command line in Python?
                            
                                Multi Index Sorting in Pandas
                            
                                How do I make pyCharm stop hiding (unfold) my Python imports?
                            
                                Is there a multi-dimensional version of arange/linspace in numpy?
                            
                                What is the difference between np.mean and tf.reduce_mean?
                            
                                Difference between data type 'datetime64[ns]' and '<M8[ns]'?
                            
                                Getting "global name 'foo' is not defined" with Python's timeit
                            
                                Get first element of Series without knowing the index [duplicate]
                            
                                How to get char from string by index?
                            
                                How can I open an Excel file in Python?
                            
                                In Python, what does dict.pop(a,b) mean?
                            
                                psycopg2: AttributeError: 'module' object has no attribute 'extras'
                            
                                matplotlib.pyplot will not forget previous plots - how can I flush/refresh?
                            
                                Create Empty Dataframe in Pandas specifying column types
                            
                                How do you composite an image onto another image with PIL in Python?
                            
                                Replace invalid values with None in Pandas DataFrame
                            
                                memory-efficient built-in SqlAlchemy iterator/generator?
                            
                                Access index of last element in data frame
                            
                                Generic catch for python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Convert Python ElementTree to string

Tags:

python

xml

marshalling

elementtree

People also ask

How do I convert `ElementTree.Element` to a String?

Example usage

Explanation

Why not use str()?

Non-Latin Answer Extension

Recent Activity

Donate For Us

Convert Python ElementTree to string

Tags:

python

xml

marshalling

elementtree

People also ask

How do I convert ElementTree.Element to a String?

Example usage

Explanation

Why not use str()?

Non-Latin Answer Extension

Related questions

Recent Activity

Donate For Us

How do I convert `ElementTree.Element` to a String?