lxml offers a few different functions to parse strings. Two of them, <code>etree.fromstring()</code> and <code>etree.XML()</code>, seem very similar. The docstring for the former says it's for parsing "strings", while the latter "string constants". Additionally, <code>XML()</code>'s docstring states: <blockquote> This function can be used to embed "XML literals" in Python code, [...] </blockquote> What's the functional difference between these functions? When should one be used over the other?

Looking at the source code, for <code>XML()</code> and <code>fromstring()</code>, the former has this extra snippet of code: <pre class="prettyprint lang-py prettyprint-override"><code>if parser is None: parser = __GLOBAL_PARSER_CONTEXT.getDefaultParser() if not isinstance(parser, XMLParser): parser = __DEFAULT_XML_PARSER </code></pre> They thus differ in side effects: <code>XML()</code> only uses the default XML parser as the default parser. If the default parser were changed to a non-<code>XMLParser</code>, <code>XML()</code> will ignore it. <pre class="prettyprint lang-py prettyprint-override"><code>etree.set_default_parser(etree.HTMLParser()) etree.tostring(etree.fromstring("<root/>")) # b'<html><body><root/></body></html>' etree.tostring(etree.XML("<root/>")) # b'<root/>' </code></pre>

What's the functional difference between `etree.fromstring()` and `etree.XML()` in lxml?

Tags:

python

lxml

lxml offers a few different functions to parse strings. Two of them, etree.fromstring() and etree.XML(), seem very similar. The docstring for the former says it's for parsing "strings", while the latter "string constants". Additionally, XML()'s docstring states:

This function can be used to embed "XML literals" in Python code, [...]

What's the functional difference between these functions? When should one be used over the other?

297

asked Aug 06 '17 19:08

outis

1 Answers

Looking at the source code, for XML() and fromstring(), the former has this extra snippet of code:

Click to copy

if parser is None:
    parser = __GLOBAL_PARSER_CONTEXT.getDefaultParser()
    if not isinstance(parser, XMLParser):
        parser = __DEFAULT_XML_PARSER

They thus differ in side effects: XML() only uses the default XML parser as the default parser. If the default parser were changed to a non-XMLParser, XML() will ignore it.

Click to copy

etree.set_default_parser(etree.HTMLParser())
etree.tostring(etree.fromstring("<root/>"))
# b'<html><body><root/></body></html>'
etree.tostring(etree.XML("<root/>"))
# b'<root/>'

176

answered Sep 29 '22 12:09

outis

Related questions
                            
                                Bokeh widgets call CustomJS and Python callback for single event?
                            
                                Python kivy - how to reduce height of TextInput
                            
                                Why is processing a random list so much faster than processing an ordered list?
                            
                                How to check if local file is same as S3 object without downloading it with boto3?
                            
                                Popen: differences between python 2 and 3
                            
                                Flask Dynamic dependent dropdown list
                            
                                boto3 start/stop RDS instance with AWS Lambda
                            
                                Aggregating Dataframe in groups of 3
                            
                                restrict 1 word as case sensitive and other as case insensitive in python regex | (pipe)
                            
                                Equivalent of Python's List Comprehension in Javascript
                            
                                struct pack return is too long
                            
                                Python requests - 403 forbidden - despite setting `User-Agent` headers
                            
                                Best way to access class-method into instance method
                            
                                Comparing a value from one dataframe with values from columns in another dataframe and getting the data from third column
                            
                                Behavior of np.c_ with list and tuple arguments
                            
                                What is the fastest way to plot coordinates on map inline (Jupyter)?
                            
                                TypeError: 'KeysView' object does not support indexing
                            
                                python ternary if statement does not catch None
                            
                                How to set length for python Faker fields
                            
                                Print version of a module without importing the entire package

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What's the functional difference between `etree.fromstring()` and `etree.XML()` in lxml?

Tags:

python

lxml

outis

People also ask

1 Answers

outis

Recent Activity

Donate For Us