Is there a way to ignore the XML namespace in tage names in <code>elementtree.ElementTree</code>? I try to print all <code>technicalContact</code> tags: <pre class="prettyprint"><code>for item in root.getiterator(tag='{http://www.example.com}technicalContact'): print item.tag, item.text </code></pre> And I get something like: <pre class="prettyprint"><code>{http://www.example.com}technicalContact blah@example.com </code></pre> But what I really want is: <pre class="prettyprint"><code>technicalContact blah@example.com </code></pre> Is there a way to display only the suffix (sans xmlns), or better - iterate over the elements without explicitly stating xmlns?

You can define a generator to recursively search through your element tree in order to find tags which end with the appropriate tag name. For example, something like this: <pre class="prettyprint"><code>def get_element_by_tag(element, tag): if element.tag.endswith(tag): yield element for child in element: for g in get_element_by_tag(child, tag): yield g </code></pre> This just checks for tags which end with <code>tag</code>, i.e. ignoring any leading namespace. You can then iterate over any tag you want as follows: <pre class="prettyprint"><code>for item in get_element_by_tag(elemettree, 'technicalContact'): ... </code></pre> This generator in action: <pre class="prettyprint"><code>>>> xml_str = """<root xmlns="http://www.example.com"> ... <technicalContact>Test1</technicalContact> ... <technicalContact>Test2</technicalContact> ... </root> ... """ xml_etree = etree.fromstring(xml_str) >>> for item in get_element_by_tag(xml_etree, 'technicalContact') ... print item.tag, item.text ... {http://www.example.com}technicalContact Test1 {http://www.example.com}technicalContact Test2 </code></pre>

Python: Ignore xmlns in elementtree.ElementTree

Tags:

python

xml

xml-namespaces

elementtree

Is there a way to ignore the XML namespace in tage names in elementtree.ElementTree?

I try to print all technicalContact tags:

for item in root.getiterator(tag='{http://www.example.com}technicalContact'):
        print item.tag, item.text

And I get something like:

{http://www.example.com}technicalContact [email protected]

But what I really want is:

technicalContact [email protected]

Is there a way to display only the suffix (sans xmlns), or better - iterate over the elements without explicitly stating xmlns?

416

asked Jun 27 '12 12:06

Adam Matan

1 Answers

You can define a generator to recursively search through your element tree in order to find tags which end with the appropriate tag name. For example, something like this:

def get_element_by_tag(element, tag):
    if element.tag.endswith(tag):
        yield element
    for child in element:
        for g in get_element_by_tag(child, tag):
            yield g

This just checks for tags which end with tag, i.e. ignoring any leading namespace. You can then iterate over any tag you want as follows:

for item in get_element_by_tag(elemettree, 'technicalContact'):
    ...

This generator in action:

>>> xml_str = """<root xmlns="http://www.example.com">
... <technicalContact>Test1</technicalContact>
... <technicalContact>Test2</technicalContact>
... </root>
... """

xml_etree = etree.fromstring(xml_str)

>>> for item in get_element_by_tag(xml_etree, 'technicalContact')
...     print item.tag, item.text
... 
{http://www.example.com}technicalContact Test1
{http://www.example.com}technicalContact Test2

106

answered Oct 02 '22 16:10

Chris

Related questions
                            
                                Using lxml to parse namepaced HTML?
                            
                                How to prepare a dataset for Keras?
                            
                                Python scikit learn n_jobs
                            
                                Python glob.glob always returns empty list
                            
                                Python ggplot- ggsave function not defined
                            
                                Persistence Database(MySQL/MongoDB/Cassandra/BigTable/BigData) Vs Non-Persistence Array (PHP/PYTHON)
                            
                                Python - Cerberus, jsonschema, voluptous - Which one will be appropriate? [closed]
                            
                                Integrate Python based TensorFlow into a .NET application [closed]
                            
                                Access webcam using OpenCV (Python) in Docker?
                            
                                Why doesn't tempfile.SpooledTemporaryFile implement readable, writable, seekable?
                            
                                Duplicate log entries with Google Cloud Stackdriver logging of Python code on Kubernetes Engine
                            
                                pip download without executing setup.py
                            
                                How to install python-distutils for old python versions
                            
                                How to efficiently run multiple Pytorch Processes / Models at once ? Traceback: The paging file is too small for this operation to complete
                            
                                Why is there no speed-up when using pythons multiprocessing for embarassingly parallel problem within a for-loop, with shared numpy data?
                            
                                python setup.py develop to override installed version
                            
                                Parsing mbox files in Python
                            
                                python setup.py configuration to install files in custom directories
                            
                                pymongo connection pooling and client requests
                            
                                print a binary tree on its side

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With