I am looking to edit XML files using python. I want to find and replace keywords in the tags. In the past, a co-worker had set up template XML files and used a "find and replace" program to replace these key words. I want to use python to find and replace these key words with values. I have been teaching myself the Elementtree module, but I am having trouble trying to do a find and replace. I have attached a snid-bit of my XML file. You will seen some variables surrounded by % (ie %SITEDESCR%) These are the words I want to replace and then save the XML to a new file. Any help or suggestions would be great. Thanks, Mike <pre class="prettyprint"><code><metadata> <idinfo> <citation> <citeinfo> <origin>My Company</origin> <pubdate>05/04/2009</pubdate> <title>POLYGONS</title> <geoform>vector digital data</geoform> <onlink>\\C$\ArcGISDevelopment\Geodatabase\PDA_STD_05_25_2009.gdb</onlink> </citeinfo> </citation> <descript> <abstract>This dataset represents the mapped polygons developed from the field data for the %SITEDESCR%.</abstract> <purpose>This dataset was created to accompany some stuff.</purpose> </descript> <timeperd> <timeinfo> <rngdates> <begdate>%begdate%</begdate> <begtime>unknown</begtime> <enddate>%enddate%</enddate> <endtime>unknown</endtime> </rngdates> </timeinfo> <current>ground condition</current> </timeperd> </code></pre>

The basics: <pre class="prettyprint"><code>from xml.etree import ElementTree as et tree = et.parse(datafile) tree.find('idinfo/timeperd/timeinfo/rngdates/begdate').text = '1/1/2011' tree.find('idinfo/timeperd/timeinfo/rngdates/enddate').text = '1/1/2011' tree.write(datafile) </code></pre> You can shorten the path if the tag name is unique. This syntax finds the first node at any depth level in the tree. <pre class="prettyprint"><code>tree.find('.//begdate').text = '1/1/2011' tree.find('.//enddate').text = '1/1/2011' </code></pre> Also, read the documentation, esp. the XPath support for locating nodes.

If you just want to replace the bits enclosed with <code>%</code>, then this isn't really an XML problem. You can easily do it with regex: <pre class="prettyprint"><code>import re xmlstring = open('myxmldocument.xml', 'r').read() substitutions = {'SITEDESCR': 'myvalue', ...} pattern = re.compile(r'%([^%]+)%') xmlstring = re.sub(pattern, lambda m: substitutions[m.group(1)], xmlstring) </code></pre>

Find and Replace Values in XML using Python

Tags:

python

find

replace

xml

I am looking to edit XML files using python. I want to find and replace keywords in the tags. In the past, a co-worker had set up template XML files and used a "find and replace" program to replace these key words. I want to use python to find and replace these key words with values. I have been teaching myself the Elementtree module, but I am having trouble trying to do a find and replace. I have attached a snid-bit of my XML file. You will seen some variables surrounded by % (ie %SITEDESCR%) These are the words I want to replace and then save the XML to a new file. Any help or suggestions would be great.

Thanks, Mike

<metadata> <idinfo> <citation> <citeinfo>  <origin>My Company</origin>  <pubdate>05/04/2009</pubdate>  <title>POLYGONS</title>  <geoform>vector digital data</geoform>  <onlink>\\C$\ArcGISDevelopment\Geodatabase\PDA_STD_05_25_2009.gdb</onlink> </citeinfo> </citation>  <descript>  <abstract>This dataset represents the mapped polygons developed from the field data for the %SITEDESCR%.</abstract>  <purpose>This dataset was created to accompany some stuff.</purpose>  </descript> <timeperd> <timeinfo> <rngdates>  <begdate>%begdate%</begdate>  <begtime>unknown</begtime>  <enddate>%enddate%</enddate>  <endtime>unknown</endtime>  </rngdates>  </timeinfo>  <current>ground condition</current>  </timeperd>

422

asked Jun 29 '11 16:06

Mike

2 Answers

The basics:

from xml.etree import ElementTree as et tree = et.parse(datafile) tree.find('idinfo/timeperd/timeinfo/rngdates/begdate').text = '1/1/2011' tree.find('idinfo/timeperd/timeinfo/rngdates/enddate').text = '1/1/2011' tree.write(datafile)

You can shorten the path if the tag name is unique. This syntax finds the first node at any depth level in the tree.

tree.find('.//begdate').text = '1/1/2011' tree.find('.//enddate').text = '1/1/2011'

Also, read the documentation, esp. the XPath support for locating nodes.

109

answered Sep 23 '22 00:09

Mark Tolonen

If you just want to replace the bits enclosed with %, then this isn't really an XML problem. You can easily do it with regex:

import re xmlstring = open('myxmldocument.xml', 'r').read() substitutions = {'SITEDESCR': 'myvalue', ...} pattern = re.compile(r'%([^%]+)%') xmlstring = re.sub(pattern, lambda m: substitutions[m.group(1)], xmlstring)

answered Sep 20 '22 00:09

Ismail Badawi

Related questions
                            
                                Python: Assign print output to a variable
                            
                                Python super() behavior not dependable
                            
                                Python dictionary search values for keys using regular expression
                            
                                Change a string of integers separated by spaces to a list of int
                            
                                Python dateutil.parser.parse parses month first, not day
                            
                                What is a "good" palette for divergent colors in R? (or: can viridis and magma be combined together?)
                            
                                How do I run another script in Python without waiting for it to finish? [duplicate]
                            
                                get class name for empty queryset in django
                            
                                How to restore a builtin that I overwrote by accident?
                            
                                See when packages were installed / updated using pip
                            
                                NotImplementedError: Layers with arguments in `__init__` must override `get_config`
                            
                                Get timer ticks in Python
                            
                                How can I pickle a dynamically created nested class in python?
                            
                                Point in Polygon with geoJSON in Python
                            
                                Django Test Client Method Override Header
                            
                                Merge a list of dataframes to create one dataframe [duplicate]
                            
                                Python scope: "UnboundLocalError: local variable 'c' referenced before assignment" [duplicate]
                            
                                RSS feed parser library in Python [closed]
                            
                                Python Replace \\ with \
                            
                                Django : Filter query based on custom function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With