Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Remove ns0 from XML

Tags:

python

xml

lxml

I have an XML file where I would like to edit certain attributes. I am able to properly edit the attributes but when I write the changes to the file, the tags have a strange "ns0" added onto them. How can I get rid of this? This is what I have tried and have been unsuccessful. I am working in Python and using lxml.

import xml.etree.ElementTree as ET
from xml.etree import ElementTree as etree
from lxml import etree, objectify
frag_xml_tree = ET.parse(xml_name)
frag_root = frag_xml_tree.getroot()

for e in frag_root: 
    for elem in frag_root.iter(e):
        elem.attrib[frag_param_name] = update_val
    etree.register_namespace("", "http://www.w3.org/2001")
    frag_xml_tree.write(xml_name)

However, when I do this, I only get the error Invalid tag name u''. I thought this error came up if the xml tags started with digits but that is not the case with my xml. I am really stuck on how to proceed. Thanks

like image 668
user4500293 Avatar asked Aug 01 '16 21:08

user4500293


2 Answers

Actually the way to do it seemed to be a combination of two things.

  1. The import statement is import xml.etree.ElementTree as ET
  2. ET.register_namespace("", NAMESPACE) is the correct call, where NAMESPACE is the namespace listed in the input xml, ie the url after xmlns.
like image 183
user4500293 Avatar answered Sep 19 '22 23:09

user4500293


Here's the corrected code using only xml.etree.ElementTree instead of lxml:

import xml.etree.ElementTree as ET
frag_xml_tree = ET.parse(xml_name)
frag_root = frag_xml_tree.getroot()

for e in frag_root: 
    for elem in frag_root.iter(e):
        elem.attrib[frag_param_name] = update_val
    ET.register_namespace("", "http://www.w3.org/2001")
    frag_xml_tree.write(xml_name)
like image 23
Josh Correia Avatar answered Sep 20 '22 23:09

Josh Correia