Alter namespace prefixing with ElementTree in Python

Tags:

By default, when you call ElementTree.parse(someXMLfile) the Python ElementTree library prefixes every parsed node with it's namespace URI in Clark's Notation:

    {http://example.org/namespace/spec}mynode

This makes accessing specific nodes by name a huge pain later in the code.

I've read through the docs on ElementTree and namespaces and it looks like the iterparse() function should allow me to alter the way the parser prefixes namespaces, but for the life of me I can't actually make it change the prefix. It seems like that may happen in the background before the ns-start event even fires as in this example:

for event, elem in iterparse(source):
    if event == "start-ns":
        namespaces.append(elem)
    elif event == "end-ns":
        namespaces.pop()
    else:
        ...

How do I make it change the prefixing behavior and what is the proper thing to return when the function ends?

598

asked Aug 08 '09 20:08

Gabriel Hurley

2 Answers

You don't specifically need to use iterparse. Instead, the following script:

from cStringIO import StringIO
import xml.etree.ElementTree as ET

NS_MAP = {
    'http://www.red-dove.com/ns/abc' : 'rdc',
    'http://www.adobe.com/2006/mxml' : 'mx',
    'http://www.red-dove.com/ns/def' : 'oth',
}

DATA = '''<?xml version="1.0" encoding="utf-8"?>
<rdc:container xmlns:mx="http://www.adobe.com/2006/mxml"
                 xmlns:rdc="http://www.red-dove.com/ns/abc"
                 xmlns:oth="http://www.red-dove.com/ns/def">
  <mx:Style>
    <oth:style1/>
  </mx:Style>
  <mx:Style>
    <oth:style2/>
  </mx:Style>
  <mx:Style>
    <oth:style3/>
  </mx:Style>
</rdc:container>'''

tree = ET.parse(StringIO(DATA))
some_node = tree.getroot().getchildren()[1]
print ET.fixtag(some_node.tag, NS_MAP)
some_node = some_node.getchildren()[0]
print ET.fixtag(some_node.tag, NS_MAP)

produces

('mx:Style', None)
('oth:style2', None)

Which shows how you can access the fully-qualified tag names of individual nodes in a parsed tree. You should be able to adapt this to your specific needs.

answered Sep 21 '22 06:09

Vinay Sajip

xml.etree.ElementTree doesn't appear to have fixtag, well, not according to the documentation. However I've looked at some source code for fixtag and you do:

import xml.etree.ElementTree as ET

for event, elem in ET.iterparse(inFile, events=("start", "end")):
    namespace, looktag = string.split(elem.tag[1:], "}", 1)

You have the tag string in looktag, suitable for a lookup. The namespace is in namespace.

answered Sep 20 '22 06:09

elves

Related questions
                            
                                Pros and cons of 'script' vs. 'entry_point' in Python command line scripts
                            
                                Using Tweepy to listen to stream and search for tweets. How to stop previous search and only listen for new stream?
                            
                                Can we use apps.py for application-level configuration as a contrast to settings.py for project-level configurations?
                            
                                Multiple sessions and graphs in Tensorflow (in the same process)
                            
                                pyGame full core usage in simple loop
                            
                                Conda: Choose where packages are downloaded
                            
                                Understanding Pycharm's profiler's results vs. cProfile results and how to get more detail on standard library functions
                            
                                Training a tf.keras model with a basic low-level TensorFlow training loop doesn't work
                            
                                How to efficiently use asyncio when calling a method on a BaseProxy?
                            
                                PyQt vs PySide comparison [closed]
                            
                                How to delete a record from table?
                            
                                What are some good ways of estimating 'approximate' semantic similarity between sentences?
                            
                                Define remote interpreter on remote Linux machine using Pydev and RSE Server
                            
                                Jinja2: How to use named blocks inside included templates, inside extendable template
                            
                                How to perform a chi-squared goodness of fit test using scientific libraries in Python?
                            
                                Compute the gradient of the SVM loss function
                            
                                Sampling n= 2000 from a Dask Dataframe of len 18000 generates error Cannot take a larger sample than population when 'replace=False'
                            
                                Interactive matplotlib using ipywidgets
                            
                                Where are the gains using numba coming from for pure numpy code?
                            
                                Cache Julia module for faster startup and usage in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Alter namespace prefixing with ElementTree in Python

Tags:

python

namespaces

xml

elementtree

Gabriel Hurley

People also ask

2 Answers

Vinay Sajip

elves

Recent Activity

Donate For Us