I'm using lxml (2.2.8) to create and write out some XML (specifically XGMML). The app which will be reading it is apparently fairly fussy and wants to see a top level element with: <pre class="prettyprint"><code><graph label="Test" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xlink="h ttp://www.w3.org/1999/xlink" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax- ns#" xmlns:cy="http://www.cytoscape.org" xmlns="http://www.cs.rpi.edu/XGMML" di rected="1"> </code></pre> How do I setup those <code>xmlns:</code> attributes with lxml ? If I try the obvious <pre class="prettyprint"><code>root.attrib['xmlns:dc']='http://purl.org/dc/elements/1.1/' root.attrib['xmlns:xlink']='http://www.w3.org/1999/xlink' root.attrib['xmlns:rdf']='http://www.w3.org/1999/02/22-rdf-syntax-ns#' root.attrib['xmlns:cy']='http://www.cytoscape.org' root.attrib['xmlns']='http://www.cs.rpi.edu/XGMML' </code></pre> lxml throws a <code>ValueError: Invalid attribute name u'xmlns:dc'</code> I've used XML and lxml a fair amount in the past for simple stuff, but managed to avoid needing to know anything about namespaces so far.

Unlike ElementTree or other serializers that would allow this, <code>lxml</code> needs you to set up these namespaces beforehand: <pre class="prettyprint"><code>NSMAP = {"dc" : 'http://purl.org/dc/elements/1.1', "xlink" : 'http://www.w3.org/1999/xlink'} root = Element("graph", nsmap = NSMAP) </code></pre> (and so on and so forth for the rest of the declarations) And then you can use the namespaces using their proper declarations: <pre class="prettyprint"><code>n = SubElement(root, "{http://purl.org/dc/elements/1.1}foo") </code></pre> Of course this gets annoying to type, so it is generally beneficial to assign the paths to short constant names: <pre class="prettyprint"><code>DCNS = "http://purl.org/dc/elements/1.1" </code></pre> And then use that variable in both the <code>NSMAP</code> and the <code>SubElement</code> declarations: <pre class="prettyprint"><code>n = SubElement(root, "{%s}foo" % (DCNS)) </code></pre>

How to write namespaced element attributes with LXML?

Tags:

python

xml-namespaces

lxml

cytoscape

I'm using lxml (2.2.8) to create and write out some XML (specifically XGMML). The app which will be reading it is apparently fairly fussy and wants to see a top level element with:

<graph label="Test" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xlink="h
ttp://www.w3.org/1999/xlink" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-
ns#" xmlns:cy="http://www.cytoscape.org" xmlns="http://www.cs.rpi.edu/XGMML"  di
rected="1">

How do I setup those xmlns: attributes with lxml ? If I try the obvious

root.attrib['xmlns:dc']='http://purl.org/dc/elements/1.1/'
root.attrib['xmlns:xlink']='http://www.w3.org/1999/xlink'
root.attrib['xmlns:rdf']='http://www.w3.org/1999/02/22-rdf-syntax-ns#'
root.attrib['xmlns:cy']='http://www.cytoscape.org'
root.attrib['xmlns']='http://www.cs.rpi.edu/XGMML'

lxml throws a ValueError: Invalid attribute name u'xmlns:dc'

I've used XML and lxml a fair amount in the past for simple stuff, but managed to avoid needing to know anything about namespaces so far.

718

asked Oct 09 '11 10:10

timday

2 Answers

Unlike ElementTree or other serializers that would allow this, lxml needs you to set up these namespaces beforehand:

NSMAP = {"dc" : 'http://purl.org/dc/elements/1.1',
         "xlink" : 'http://www.w3.org/1999/xlink'}

root = Element("graph", nsmap = NSMAP)

(and so on and so forth for the rest of the declarations)

And then you can use the namespaces using their proper declarations:

n = SubElement(root, "{http://purl.org/dc/elements/1.1}foo")

Of course this gets annoying to type, so it is generally beneficial to assign the paths to short constant names:

DCNS = "http://purl.org/dc/elements/1.1"

And then use that variable in both the NSMAP and the SubElement declarations:

n = SubElement(root, "{%s}foo" % (DCNS))

178

answered Oct 22 '22 07:10

Nick Bastin

Using ElementMaker:

import lxml.etree as ET
import lxml.builder as builder
E = builder.ElementMaker(namespace='http://www.cs.rpi.edu/XGMML',
                         nsmap={None: 'http://www.cs.rpi.edu/XGMML',
                         'dc': 'http://purl.org/dc/elements/1.1/',
                         'xlink': 'http://www.w3.org/1999/xlink',
                         'rdf': 'http://www.w3.org/1999/02/22-rdf-syntax-ns#',
                         'cy': 'http://www.cytoscape.org', })
graph = E.graph(label="Test", directed="1")
print(ET.tostring(graph, pretty_print=True))

yields

<graph xmlns:cy="http://www.cytoscape.org" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://www.cs.rpi.edu/XGMML" directed="1" label="Test"/>

answered Oct 22 '22 07:10

unutbu

Related questions
                            
                                Seaborn: annotate the linear regression equation
                            
                                How to plot two pandas time series on same plot with legends and secondary y-axis?
                            
                                Indicating multiple value in a Dict[] for type hints
                            
                                Python — check if a string contains Cyrillic characters
                            
                                How to crop or remove white background from an image
                            
                                pandas: dataframe to_csv, how to set column names
                            
                                How to run a coroutine outside of an event loop?
                            
                                Send message using Django Channels from outside Consumer class
                            
                                how overwrite Response class in django rest framework ( DRF )?
                            
                                How to hide secret keys in Google Colaboratory from users having the sharing link?
                            
                                Spacy nlp = spacy.load("en_core_web_lg")
                            
                                How do you express a Python Callable with no arguments?
                            
                                Why do NaN values make min and max sensitive to order? [duplicate]
                            
                                How would one implement Lazy Evaluation in C?
                            
                                Python doctest: Skip entire block?
                            
                                Can SQLAlchemy eager/joined loads be suppressed once set up?
                            
                                Start background process/daemon from CGI script
                            
                                TeX rendering, curly braces, and string formatting syntax in matplotlib
                            
                                STARTTLS extension not supported by server
                            
                                How to add builtin functions?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With