My XML file looks like the following: <pre class="prettyprint"><code><?xml version="1.0"?> <ItemSearchResponse xmlns="http://webservices.amazon.com/AWSECommerceService/2008-08-19"> <Items> <Item> <ItemAttributes> <ListPrice> <Amount>2260</Amount> </ListPrice> </ItemAttributes> <Offers> <Offer> <OfferListing> <Price> <Amount>1853</Amount> </Price> </OfferListing> </Offer> </Offers> </Item> </Items> </ItemSearchResponse> </code></pre> All I want to do is extract the ListPrice. This is the code I am using: <pre class="prettyprint"><code>>> from elementtree import ElementTree as ET >> fp = open("output.xml","r") >> element = ET.parse(fp).getroot() >> e = element.findall('ItemSearchResponse/Items/Item/ItemAttributes/ListPrice/Amount') >> for i in e: >> print i.text >> >> e >> </code></pre> Absolutely no output. I also tried <pre class="prettyprint"><code>>> e = element.findall('Items/Item/ItemAttributes/ListPrice/Amount') </code></pre> No difference. What am I doing wrong?

There are 2 problems that you have. 1) <code>element</code> contains only the root element, not recursively the whole document. It is of type Element not ElementTree. 2) Your search string needs to use namespaces if you keep the namespace in the XML. To fix problem #1: You need to change: <pre class="prettyprint"><code>element = ET.parse(fp).getroot() </code></pre> to: <pre class="prettyprint"><code>element = ET.parse(fp) </code></pre> To fix problem #2: You can take off the xmlns from the XML document so it looks like this: <pre class="prettyprint"><code><?xml version="1.0"?> <ItemSearchResponse> <Items> <Item> <ItemAttributes> <ListPrice> <Amount>2260</Amount> </ListPrice> </ItemAttributes> <Offers> <Offer> <OfferListing> <Price> <Amount>1853</Amount> </Price> </OfferListing> </Offer> </Offers> </Item> </Items> </ItemSearchResponse> </code></pre> With this document you can use the following search string: <pre class="prettyprint"><code>e = element.findall('Items/Item/ItemAttributes/ListPrice/Amount') </code></pre> The full code: <pre class="prettyprint"><code>from elementtree import ElementTree as ET fp = open("output.xml","r") element = ET.parse(fp) e = element.findall('Items/Item/ItemAttributes/ListPrice/Amount') for i in e: print i.text </code></pre> Alternate fix to problem #2: Otherwise you need to specify the xmlns inside the srearch string for each element. The full code: <pre class="prettyprint"><code>from elementtree import ElementTree as ET fp = open("output.xml","r") element = ET.parse(fp) namespace = "{http://webservices.amazon.com/AWSECommerceService/2008-08-19}" e = element.findall('{0}Items/{0}Item/{0}ItemAttributes/{0}ListPrice/{0}Amount'.format(namespace)) for i in e: print i.text </code></pre> <hr> Both print: <blockquote> 2260 </blockquote>

Using XPath in ElementTree

Tags:

python

xml

xpath

elementtree

My XML file looks like the following:

Click to copy

<?xml version="1.0"?> <ItemSearchResponse xmlns="http://webservices.amazon.com/AWSECommerceService/2008-08-19">   <Items>     <Item>       <ItemAttributes>         <ListPrice>           <Amount>2260</Amount>         </ListPrice>       </ItemAttributes>       <Offers>         <Offer>           <OfferListing>             <Price>               <Amount>1853</Amount>             </Price>           </OfferListing>         </Offer>       </Offers>     </Item>   </Items> </ItemSearchResponse>

All I want to do is extract the ListPrice.

This is the code I am using:

Click to copy

>> from elementtree import ElementTree as ET >> fp = open("output.xml","r") >> element = ET.parse(fp).getroot() >> e = element.findall('ItemSearchResponse/Items/Item/ItemAttributes/ListPrice/Amount') >> for i in e: >>    print i.text >> >> e >>

Absolutely no output. I also tried

Click to copy

>> e = element.findall('Items/Item/ItemAttributes/ListPrice/Amount')

No difference.

What am I doing wrong?

332

asked Aug 23 '09 19:08

Ryan R. Rosario

1 Answers

There are 2 problems that you have.

1) element contains only the root element, not recursively the whole document. It is of type Element not ElementTree.

2) Your search string needs to use namespaces if you keep the namespace in the XML.

To fix problem #1:

You need to change:

Click to copy

element = ET.parse(fp).getroot()

to:

Click to copy

element = ET.parse(fp)

To fix problem #2:

You can take off the xmlns from the XML document so it looks like this:

Click to copy

<?xml version="1.0"?> <ItemSearchResponse>   <Items>     <Item>       <ItemAttributes>         <ListPrice>           <Amount>2260</Amount>         </ListPrice>       </ItemAttributes>       <Offers>         <Offer>           <OfferListing>             <Price>               <Amount>1853</Amount>             </Price>           </OfferListing>         </Offer>       </Offers>     </Item>   </Items> </ItemSearchResponse>

With this document you can use the following search string:

Click to copy

e = element.findall('Items/Item/ItemAttributes/ListPrice/Amount')

The full code:

Click to copy

from elementtree import ElementTree as ET fp = open("output.xml","r") element = ET.parse(fp) e = element.findall('Items/Item/ItemAttributes/ListPrice/Amount') for i in e:   print i.text

Alternate fix to problem #2:

Otherwise you need to specify the xmlns inside the srearch string for each element.

The full code:

Click to copy

from elementtree import ElementTree as ET fp = open("output.xml","r") element = ET.parse(fp)  namespace = "{http://webservices.amazon.com/AWSECommerceService/2008-08-19}" e = element.findall('{0}Items/{0}Item/{0}ItemAttributes/{0}ListPrice/{0}Amount'.format(namespace)) for i in e:     print i.text

Both print:

2260

137

answered Sep 17 '22 18:09

Brian R. Bondy

Related questions
                            
                                Is it possible to change the model name in the django admin site?
                            
                                Slicing a list into n nearly-equal-length partitions [duplicate]
                            
                                Django and query string parameters
                            
                                Why doesn't Python evaluate constant number arithmetic before compiling to bytecode?
                            
                                Appending two dataframes with same columns, different order
                            
                                Overloading __dict__() on python class
                            
                                pandas groupby and join lists
                            
                                Reading input sound signal using Python
                            
                                How to install dependencies from a copied pipfile inside a virtual environment?
                            
                                How to exit when viewing python help like help(os.listdir)
                            
                                Function with varying number of For Loops (python)
                            
                                Saving numpy array to txt file row wise
                            
                                python regex first/shortest match
                            
                                How do I test dictionary-equality with Python's doctest-package?
                            
                                setting up s3 for logs in airflow
                            
                                How to output CDATA using ElementTree
                            
                                Creating dummy variables in pandas for python
                            
                                set very low values to zero in numpy
                            
                                Pandas Writing Dataframe Columns to csv
                            
                                how to read certain columns from Excel using Pandas - Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using XPath in ElementTree

Tags:

python

xml

xpath

elementtree

Ryan R. Rosario

People also ask

1 Answers

Brian R. Bondy

Recent Activity

Donate For Us